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GENES OF CAROTENOID BIOSYNTHESIS AND METABOLISM 
AND METHODS OF USE THEREOF 

RACK GROT J ND OF THF fNTVENTION 1 § 

Field of the Invention 

5 The present invention describes nucleic acid sequences for eukaryotic genes encoding , 

g lycopene e-cyclase (also known as E-cyclase and £ lycopene cyclase), isopentenyl 
pyrophosphate isomerase (IPP) and p -carotene hydroxylase as well as vectors containing the 
same and hosts transformed with said vectors. The present invention also provides methods 
for augmenting the accumulation of carotenoids, changing the composition of the 
10 carotenoids, and producing novel and rare carotenoids. The present invention provides 

methods for controlling the ratio or relative amounts of various carotenoids in a host. The 
invention also relates to modified lycopene e-cyclase, IPP isomerase and P -carotene 
hydroxylase. Additionally, the present invention provides a method for screening for genes 
and cDNAs encoding enzymes of carotenoid biosynthesis and metabolism. 

15 Background of the Invention 

Carotenoid pigments with cyclic endgroups are essential components of the 
photosynthetic apparatus in oxygenic photosynthetic organisms (e.g., cyanobacteria, algae 
and plants; Goodwin, 1980). The symmetrical bicyclic yellow carotenoid pigment P- 
carotene (or, in rare cases, the asymmetrical bicyclic a -carotene) is intimately associated with 

20 the photosynthetic reaction centers and plays a vital role in protecting against potentially 
lethal photooxidative damage (Koyama, 1991). p -carotene and other carotenoids derived 
from it or from a -carotene also serve as light-harvesting pigments (Siefermann-Harms, 
1987), are involved in the thermal dissipation of excess light energy captured by the light- 
harvesting antenna (Demmig- Adams & Adams, 1992), provide substrate for the biosynthesis 

25 of the plant growth regulator abscisic acid (Rock & Zeevaart, 1991 ; Parry & Horgan, 1991), 
and are precursors of vitamin A in human and animal diets (Krinsky, 1987). Plants also 
exploit carotenoids as coloring agents in flowers and fruits to attract pollinators and agents of 
seed dispersal (Goodwin, 1980). The color provided by carotenoids is also of agronomic 
value in a number of important crops. Carotenoids are currently harvested from a variety of 

30 organisms, including plants, algae, yeasts, cyanobacteria and bacteria, for use as pigments in 
food and feed. 
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The probable pathway for formation of cyclic carotenoids in plants, algae and 
cyanobacteria is illustrated in Figure 1 . Two types of cyclic endgroups or rings are 
commonly found in higher plant carotenoids, these are referred to as the P (beta) and e 
(epsilon) rings (Fig. 3). The precursor acyclic endgroup (no ring structure) is referred to as 
5 the H {psi) endgroup. The P and e endgroups differ only in the position of the double bond in 
the ring. Carotenoids with two p rings are ubiquitous, and those with one p and one e ring 
are common, but carotenoids with two € rings are uncommon, p -carotene (Fig. 1) has two P- 
endgroups and is a symmetrical compound that is the precursor of a number of other 
important plant carotenoids such as zeaxanthin and violaxanthin (Fig. 2). 

1 0 Genes encoding enzymes of carotenoid biosynthesis have previously been isolated ' 

from a variety of sources including bacteria (Armstrong et al., 1989, Mol. Gen. Genet. 216, 
254-268; Misawa et al.,1990, J. BacterioL, 172, 6704-12), fungi (Schmidhauser et at, 1990, 
Mol. Cell. Biol. 10, 5064-70), cyanobacteria (Chamovitz et al., 1990, Z. Naturforsch, 45c, 
482-86; Cunningham. et al., 1994) and higher plants (Bartley et al., Proc. Natl. Acad. Sci USA 

15 88, 6532-36; Martinez-Ferez & Vioque, 1992, Plant Mol. Biol. 18, 981-83). Many of the 
isolated enzymes show a great diversity in structure, function and inhibitory properties 
between sources. For example, phytoene desaturases frotti the cyanobacterium 
Synechococcus and from higher plants and green algae carry out a two-step desaturation to 
yield C-carotene as a reaction product. In plants and cyanobacteria a second enzyme (£- 

20 carotene desaturase), similar in amino acid sequence to the phytoene desaturase, catalyzes 

two additional defeaturations to yield lycopene. In contrast, a single desaturase enzyme from 
Erwinia herbicola and from other bacteria introduces all four double bonds required to form 
lycopene. The Erwinia and other bacterial desaturases bear little amino acid sequence 
similarity to the plant and cyanobacterial desaturase enzymes, and are thought to be of 

25 unrelated ancestry. Therefore, even with a gene in hand from one source, it may be difficult to 
identify a gene encoding an enzyme of similar function in another organism. In particular, 
the sequence similarity between certain of the prokaryotic and eukaryotic genes encoding 
enzymes of carotenoid biosynthesis is quite low. 

Further, the mechanism of gene expression in prokaryotes and eukaryotes appears to 

30 differ sufficiently such that one cannot expect that an isolated eukaryotic gene will be 
properly expressed in a prokaryotic host. 
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The difficulties in isolating genes eiicodirig enzymes with similar functions is 
exemplified by recent efforts to isolate the gene encoding the enzyme that catalyzes the 
formation of P -carotene from the acyclic precursor lycopene. Although a gene encoding an 
enzyme with this function had been isolated from a bacterium, it had not been isolated from 
5 any photosynthetic procaryote or from any eukaryotic organism. The isolation and 

characterization of the enzyme catalyzing formation of p -carotene in the cyanobacterium 
Synechococcus PCC7942 was described by the present inventors and others (Cunningham et 
ah, 1993 and 1994). The amino acid sequence similarity of the cyanobacterial enzyme to the 
various bacterial lycopene p-cyclases is so low (ca. 18-25% overall; Cunningham et al., 

10 1 994) that there is much uncertainty as to whether they share a common ancestry or, instead, 
represent an example of convergent evolution. 

The need remains for the isolation of eukaryotic and prokaryotic genes and cDNAs 
encoding polypeptides involved in the carotenoid biosynthetic pathway, including those 
encoding a lycopene -cyclase, IPP isomerase and p -carotene hydroxylase. There remains a 

15 need for methods to enhance the production of carotenoids, to alter the composition of 

carotenoids, and to reduce or eliminate carotenoid production. There also remains a need in 
the art for methods for screening for genes and cDNAs encoding enzymes of carotenoid 
biosynthesis and metabolism. 

SI fMMARY O F TTTF TlsJVFNTTON 
20 Accordingly, a first object of this invention is to provide purified and/or isolated 

nucleic acids which encode enzymes involved in carotenoid biosynthesis; in particular, 
lycopene €-cyclase , IPP isomerase and p -carotene hydroxylase. 

A second object of this invention is to provide purified and/or isolated nucleic acids 
which encode enzymes which produce novel or uncommon carotenoids. 
25 A third object of the present invention is to provide vectors containing said genes. 

A fourth object of the present invention is to provide hosts transformed with said 
vectors. 

Another object of the present invention is to provide hosts which accumulate novel or 
uncommon carotenoids or which accumulate greater amounts of specific or total carotenoids. 
30 Another object of the present invention is to provide hosts with inhibited and/or 

altered carotenoid production. 

-3- 
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Another object of this invention is to secure the expression of eukaryotic carotenoid- 
related genes in a recombinant prokaryotic host. 

Yet another object of the present invention is to provide a method for screening for * 
eukaryotic and prokaryotic genes and cDNAs which encode enzymes involved in carotenoid 
5 biosynthesis and metabolism. 

An additional object of the invention is to provide a method for manipulating 
carotenoid biosynthesis in photosynthetic organisms by inhibiting the synthesis of certain 
enzymatic products to cause accumulation of precursor compounds. 

Another object of the invention is to provide modified lycopene e-cyclase, IPP 
10 isomerase and p -carotene hydroxylase. 

These and other objects of the present invention have been realized by the present 
inventors as described below. 

A subject of the present invention is an isolated and/or purified nucleic acid sequence 
which encodes for a protein having lycopene e-cyclase, IPP isomerase or p -carotene 
1*5 hydroxylase enzyme activity and having the amino acid sequence of SEQ ID NOS: 2, 4, 14- 
21 or 23-27. 

The invention also includes vectors which comprise any of the nucleic acid sequences 
listed above, and host cells transformed with such vectors. 

Another subject of the present invention is a method of producing or enhancing the 

20 production of a carotenoid in a host cell, comprising inserting into the host cell a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein having 
lycopene e-cyclase, IPP isomerase or p -carotene hydroxylase enzyme activity, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter; and expressing the 
heterologous nucleic acid sequence to produce the protein. 

25 Yet another subject of the present invention is a method of modifying the production 

of carotenoids in a host cell, the method comprising inserting into the host cell a vector 
comprising a heterologous nucleic acid sequence which produces an RNA and/or encodes for 
a protein which modifies lycopene e-cyclase, IPP isomerase or p -carotene hydroxylase 
enzyme activity, relative to an untransformed host cell, wherein the heterologous nucleic acid 

30 sequence is operably linked to a promoter; and expressing the heterologous nucleic acid 

sequence in the host cell to modify the production of the carotenoids in the host cell, relative 
to the untransformed host cell. 

-4- 
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The present invention also includes a mettiod of expressing, in a host cell, a 
heterologous nucleic acid sequence which encodes for a protein having lycopene e-cyclase, 
IPP isomerase or (J -carotene hydroxylase enzyme activity, the method comprising inserting 
into the host cell a vector comprising the heterologous nucleic acid sequence, wherein the 
5 heterologous nucleic acid sequence is operably linked to a promoter; and expressing the 
heterologous nucleic acid sequence. 

Also included is a method of expressing, in a host cell, a heterologous nucleic acid 
sequence which encodes for a protein which modifies lycopene €-cyclase, IPP isomerase or 
P -carotene hydroxylase enzyme activity in the host cell, relative to an untransformed host 
10 cell, the method comprising inserting into the host cell a vector comprising the heterologous 
nucleic acid sequence, wherein the heterologous nucleic acid sequence is operably linked to a 
promoter; and expressing the heterologous nucleic acid sequence. 

Another subject of the present invention is a method for screening for genes and 
cDNAs which encode enzymes involved in carotenoid biosynthesis and metabolism. 

15 BRIEE PFSCRTPTTON OF THF DRAWINGS 

A more complete appreciation of the invention and many of the attendant advantages 
thereof will be readily obtained as the same becomes better understood by reference to the 
following detailed description when considered in connection with the accompanying 
drawings, wherein: 

20 Figure 1 is a schematic representation of the putative pathway of p -carotene 

biosynthesis in cyanobacteria, algae and plants. The enzymes catalyzing various steps are 
indicated at the left. Target sites of the bleaching herbicides NFZ and MPTA are also 
indicated at the left. Abbreviations: DMAPP, dimethylallyl pyrophosphate; FPP, farnesyl 
pyrophosphate; GGPP, geranylgeranyl pyrophosphate; GPP, geranyl pyrophosphate; IPP, 

25 isopentenyl pyrophosphate; LCY, lycopene cyclase; MV A, mevalonic acid; MPTA, 2-(4- 
methylphenoxy)triethylamine hydrochloride; NFZ, norflurazon; PDS, phytoene desaturase; 
PSY, phytoene synthase; ZDS, C -carotene desaturase; PPPP, prephytoene pyrophosphate. 

Figure 2 depicts possible routes of synthesis of cyclic carotenoids and common plant 
and algal xanthophylls (oxycarotenolds) from neurosporene. Demonstrated activities of the 

30 P- and e-cyclase enzymes of A. thaliana are indicated by bold arrows labelled with p or e 
respectively. A bar below the arrow leading to e-carotene indicates that the enzymatic 
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activity was examined but no product was detected. The steps marked by an arrow with a 
dotted line have not been specifically examined. Conventional numbering of the carbon 
atoms is given for neurosporene and a -carotene. Inverted triangles (T) mark positions of the 
double bonds introduced as a consequence of the desaturation reactions. 
5 Figure 3 depicts the carotene endgroups which are found in plants. 

Figure 4 is a DNA sequence and the predicted amino acid sequence of a lycopene e- 
cyclase cDNA isolated from A. thaliana (SEQ ID NOS: 1 and 2). These sequences were 
deposited under Genbank accession number U50738. This cDNA is incorporated into the 
plasmid pATeps. 

10 Figure 5 is a DNA sequence encoding the P -carotene hydroxylase isolated from A. 

thaliana (SEQ ID NO: 3). This cDNA ijs incorporated into the plasmid pATOHB. 

Figure 6 is an alignment of the predicted amino acid sequences of A. thaliana P- 

carotene hydroxylase (SEQ LD NO: 4) with those of the bacterial P -carotene hydroxylase 

enzymes from Alicalgenes sp. (SEQ ID NO: 5) (Genbank D58422), Erwinia herbicola EholO 
15 (SEQ ID NO.: 6) (GenBank M872280), Erwinia uredovora (SEQ ID NO.: 7) (GenBank 

D90087) and Agrobacterium aurianticum (SEQ ID NO.: 8) (GenBank D58420). A 

consensus sequence is also shown. All five genes are identical where a capital letter appears 

in the consensus. A lowercase letter indicates that three of five, including A. thaliana, have 

the identical residue. TM; transmembrane. 
20 Figure 7 is a DNA sequence of a cDNA encoding an IPP isomerase isolated from A. 

thaliana (SEQ ID NO: 9). This cDNA is incorporated into the plasmid pATDPS. 

Figure 8 is a DNA sequence of a second cDNA encoding another LPP isomerase 

isolated from A. thaliana (SEQ ID NO: 10). This cDNA is incorporated into the plasmid 

pATDP7. 

25 Figure 9 is a DNA sequence of a cDNA encoding an IPP isomerase isolated from 

Haematococcus pluvialis (SEQ ID NO: 11). This cDNA is incorporated into the plasmid 
pHP04. 

Figure 10 is a DNA sequence of a second cDNA encoding another IPP isomerase 
isolated from Haematococcus pluvialis (SEQ ID NO: 12). This cDNA is incorporated into 
30 the plasmid pHP05 . 

Figure 1 1 is an alignment of the amino acid sequences predicted by IPP isomerase 
cDNAs isolated from A. thaliana (SEQ ID NO.: 16 and 18), H. pluvialis (SEQ ID NOS.: 14 



-6- 
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and 15), Clarkia breweri (SEQ ID NO,: *17) (See, Blanc & Pichersky, Plant Physiol. (1995) 
108:855; Genbank accession no. X82627) and Saccharomyces cerevisiae (SEQ ID NO.: 19) 
(Genbank accession no. J05090). 

Figure 12 is a DNA sequence of the cDNA encoding an IPP isomerase isolated from 

5 Tagetes erecta (marigold; SEQ ED NO: 13). This cDNA is incorporated into the plasmid 

i 

pPMDPl . xxx f s denote a region not originally sequenced. Figure 21 A shows the complete 
marigold sequence. 

i 

Figure 13 is an alignment of the consensus sequence of four plant P -cyclases (SEQ ID 
NO.: 20) with the A. thaliana lycopene e-cyclase (SEQ ID NO.: 21). A capital letter in the 
10 plant P consensus is used where all four p -cyclase genes predict the same amino acid residue 
in this position. A small letter indicates that an identical residue was found in three of the 
four. Dashes indicate that the amino acid residue was not conserved and dots in the. sequence 
denote a gap. A consensus for the aligned sequences is given, in capital letters below the 
alignment, where the P- and e-cyclases have the same amino acid residue. Arrows indicate 
15 some of the conserved amino acids that will be used as junction sites for construction of 
chimeric cyclases with novel enzymatic activities. Several regions of interest including a 
sequence signature indicative of a dinucleotide-binding motif and two predicted 
transmembrane (TM) helical regions are indicated below the alignment and are underlined. 

Figure 14 shows the nucleotide (SEQ ID NO:22) and amino acid sequences (SEQ ED 
20 NO:23) of the Adonis palaestina (pheasant's eye) e-cyclase cDNA #5. 

Figure 1:5 A shows the nucleotide (SEQ ED NO:24) and amino acid sequences (SEQ 
ID NO:25) of a potato e-cyclase cDNA. Figure 15B shows the amino acid sequence (SEQ ID 
NO:26) of a chimeric lettuce/potato lycopene e-cyclase. Amino acids in lower case are from 
the lettuce cDNA and those in upper case are from the potato cDNA. The product of this 
25 chimeric cDNA has e-cyclase activity and converts lycopene to the monocyclic 8-carotene. 

Figure 16 shows a comparison between the amino acid sequences of the Arabidopsis 
e-cyclase (SEQ ID NO:27) and the potato e-cyclase (SEQ ED NO:25). 

Figure 1 7 A shows the nucleotide sequence of the Adonis palaestina Ipil (SEQ ED 
NO:28) and Figure 17B shows the nucleotide sequence of the Adonis palaestina Ipi2 (SEQ 
30 ED NO: 29). 
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Figure 1 8 A shows the nucleotide sequence of the Haematoccus pluvialis Ipil (SEQ 
ED NO:l 1) and Figure 1 8B shows the nucleotide sequence of the Haematoccus pluvialis Ipi2 
(SEQ ED NO.30). 

Figure 19A shows the nucleotide sequence of the Lactuca sativa (romaine lettuce) 
5 Ipil (SEQ ID NO:31) and Figure 19B shows the nucleotide sequence of the Lactuca sativa 
Ipi2 (SEQ ED NO: 32). 

Figure 20 shows the nucleotide sequence of the Chlamydomonas reinhardtii Ipil 
(SEQ ID NO:33). 

Figure 21 A shows the nucleotide sequence of the Tagetes erecta (marigold) Ipil (SEQ 
10 ID NO: 34) and Figure 2 IB shows the nucleotide sequence of the Oryza sativa (rice) Ipil 
(SEQ ID NO:35). 

Figure 22 shows a amino acid sequence alignment of various plant and green algal 
isopentenyl isomerases (IPI) (SEQ ID NOS:16, 36-45). 

Figure 23 shows a comparison between Adonis palaestina e-cyclase cDNA #3 and 
15 cDNA #5 nucleotide sequences. 

Figure 24 shows a comparison between Adonis palaestina e-cyclase cDNA #3 and 
cDNA #5 predicted amino acid sequences. 

Figure 25 shows a sequence alignment of various plant p- and e-cyclases. Those 
sequences outlined in grey denote identical sequences among the e-cyclases. Those 
20 sequences outlined in black denote identical sequences among both the P- and e-cyclases. 

Figure 26 shows a sequence alignment of the plant e-cyclases from Figure 25. Those 
sequences outlined in black denote identical sequences among the e-cyclases. 

Figure 27 is a dendrogram or 4t tree" illustrating the degree of amino acid sequence 
similarity for various lycopene P« and e-cyclases. 
25 Figure 28 shows a comparison between Arabidopsis e-cyclase and lettuce e-cyclase 

predicted amino acid sequences. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
The present invention includes an isolated and/or purified nucleic acid sequence 
which encodes for a protein having lycopene e-cyclase, IPP isomerase or p -carotene 
30 hydroxylase enzyme activity and having the amino acid sequence of SEQ ID NOS: 2, 4, 14- 
21, 23 or 25-27. Nucleic acids encoding lycopene e-cyclase, p -carotene hydroxylase and IPP 
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isomerases have been isolated from several genetically distant sources. 

The present inventors have isolated nucleic acids encoding the enzyme IPP isomerase, 
which catalyzes the reversible conversion of isopentenyl pyrophosphate (IPP) to 
dimethylallyl pyrophosphate (DMAPP). IPP isomerase cDNAs were isolated from the plants 
5 A. thaliana, Tagetes erecta (marigold), Adonis palaestina (pheasant's eye), Lactuca sativa 
(romaine lettuce) and from the green algae H. pluvialis and Chlamydomonas reinhardtii. 

Alignments of the amino acid sequences predicted by some of these cDNAs are shown in 

♦ 

Figures 12 and 22. Plasmids containing some of these cDNAs were deposited with the 
American Type Culture Collection, 12301 Parklawn Drive, Rockville MD 20852 on March 4, 

10 1996 under ATCC accession numbers 98000 (pHP05 - H. pluvialis); 98001 (pMDPl - . . 
marigold); 98002 (pATDP7 - A, thaliand) and 98004 (pHP04 - H. pluvialis). 

The present inventors have also isolated nucleic acids encoding the enzyme P- 
carotene hydroxylase, which is responsible for hydroxylating the P-endgroup in carotenoids. 
The nucleic acid of the present invention is shown in SEQ ED NO: 3 and Figure 5. The full 

15 length cDNA product hydroxy lates both end groups of p -carotene as do products of cDNAs 
which encode proteins truncated by up to 50 amino acids from the N-terminus. Products of 
genes which encode proteins truncated between about 60-1 1 0 amino acids from the N- 
terminus preferentially hydroxylate only one ring. A plasmid containing this gene was 
deposited with the American Type Culture Collection, 12301 Parklawn Drive, Rockville MD 

20 20852 on March 4, 1996 under ATCC accession number 98003 (pATOHB - A. thaliana). 

The present inventors have also isolated nucleic acids encoding the enzyme lycopene 
e-cyclase, which is responsible for the formation of e-endgroups in carotenoids. The A. 
thaliane e-cyclase adds an e ring to only one end of the symmetrical lycopene while the 
related p -cyclase adds a ring at both ends. The A. thaliana cDNA of the present invention is 

25 shown in Figure 4 and SEQ ID NO: 1 . A plasmid containing this gene was deposited with 
the American Type Culture Collection, 12301 Parklawn Drive, Rockville MD 20852 on 
March 4, 1996 under ATCC accession number 98005 (pATeps - A. thaliana). 

In addition, lycopene e-cyclases have been identified in lettuce and in Adonis 
palaestina (cDNA #5) which encode enzymes that convert lycopene to the bicyclic e- 

30 carotene (e,e-carotene). An additional cDNA from Adonis palaestina (cDNA #3) encodes a 
lycopene e-cyclase which converts lycopene into 6-carotene (e,i|x-carotene) and differs from 
the lycopene e-cyclase which forms bicyclic e-carotene (e,e-carotene) by only 5 amino acids. 

-9- 
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One or more of these amino acids may be modified by alteration of the nucleotide sequence 
in the #5 cDNA to obtain an enzyme which forms the bicyclic e,e-carotene. The sequences 
of the Adonis palaestina and Arabidopsis thaliana e-cyclases have about 70% nucleotide 
identity and about 72% amino acid identity. 
5 Initial experiments by the inventors with chimeric genes indicated that the part of the 

e-cyclase which is responsible for adding 2 € rings to form e,e-carotene is the carboxy 
terminal portion of the gene. The lettuce e-cyclase adds two e rings to form e,e-carotene. A 
DNA encoding a partial potato e-cyclase (missing its amino terminal portion), when 
combined with an amino terminal region from the lettuce e-cyclase gene, produces a 

10 monocyclic 6 -carotene (e,ijf -carotene). With the discovery of the differences between the 

Adonis palaestina clone #3 and clone #5, the specific amino acids responsible for the addition 
of an extra e ring have been identified (Figure 24). Specifically, amino acid 55 is Thr in 
clone #3 and Ser in clone #5, amino acid 210 is Asn in clone #3 and Asp in clone #5, amino 
acid 231 is Asp in clone #3 and Glu in clone #5, amino acid 352 is He in clone #3 and Val in 

15 clone #5, and amino acid 524 is Lys in clone #3 and Arg in clone #5. It can be appreciated 

that these changes are quite conservative, as only one change, at amino acid 210, changes the 
charge of the protein. 

Thus, it is clear that the nucleic acids of the invention encoding the enzymes as 
presently disclosed may be altered to increase a particularly desirable property of the enzyme, 

20 to change a property of the enzyme, or to diminish an undesirable property of the enzyme. 

Such modifications can be by deletion, substitution, or insertion of one or more amino acids, 
and can be performed by routine enzymatic manipulation of the nucleic acid encoding the 
enzyme (such as by restriction enzyme digestion, removal of nucleotides by mung bean 
nuclease or Bal3l, insertion of nucleotides by Klenow fragment, and by religation of the 

25 ends), by site-directed mutagenesis, or may be accidental, such as by low fidelity PCR or 
those obtained through mutations in hosts that are producers of the enzymes. These 
techniques as well as other suitable techniques are well known in the art. 

Mutations can be made in the nucleic acids of the invention such that a particular 
codon is changed to a codon which codes for a different amino acid. Such a mutation is 

30 generally made by making the fewest nucleotide changes possible. A substitution mutation 
of this sort can be made to change an amino acid in the resulting protein in a non- 
conservative manner (i.e., by changing the codon from an amino acid belonging to a grouping 
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of amino acids having a particular size or characteristic to an amino acid belonging to another 
grouping) or in a conservative manner (i.e., by changing the codon from an amino acid 
belonging to a grouping of amino acids having a particular size or characteristic to an amino 
acid belonging to the same grouping). Such a conservative change generally leads to less 
5 change in the structure and function of the resulting protein. A non-conservative change is 
more likely to alter the structure, activity or function of the resulting protein. The present 
invention should be considered to include sequences containing conservative changes which 
do not significantly alter the activity or binding characteristics of the resulting protein. 
The following is one example of various groupings of amino acids: 
10 Amino acids with nonpolar R groups : Alanine. Valine. Leucine, lsoleucine. Proline, 
Phenylalanine, Tryptophan and Methionine. 

Amino acid s with uncharged polar R groups: Glycine, Serine, Threonine, Cysteine, Tyrosine, 
Asparagine and Glutamine. 

Amino acids with charged polar R groups (negatively charged at Ph 6.0): Aspartic acid and 
15 Glutamic acid. 

Rasic amino acids (positively charged at pH 6.0): Lysine, Argiriine and Histidine. 

Another grouping may be those amino acids with phenyl groups: Phenylalanine, 
Tryptophan and Tyrosine. 

Another grouping may be according to molecular weight (i.e., size of R groups). 
20 Particularly preferred substitutions are: 

- Lys for Arg and vice versa such that a positive charge may be maintained; 

- Glu for Asp and vice versa such that a negative charge may be maintained; 

- Ser for Thr such that a free -OH can be maintained; and 

- Gin for Asn such that a free NH 2 can be maintained. 

25 Amino acid substitutions may also be introduced to substitute an amino acid with a 

- particularly preferable property. For example, a Cys may be introduced to provide a potential 

site for disulfide bridges with another Cys. A His may be introduced as a particularly 
"catalytic" site (i.e., His can act as an acid or base and is the most common amino acid in 
biochemical catalysis). Pro may be introduced because of its particularly planar structure, 
30 which induces p -turns in the protein's structure. 

It is clear that certain modifications of SEQ ID NOS: 2, 4, 14-21, 23 or 25-27 can take 
place without destroying the activity of the enzyme. It is noted especially that truncated 
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versions of the nucleic acids of the invention are functional. For example, several amino 
acids (from 1 to about 120) can be deleted from the N-terminus of the lycopene e-cyclases of 
the invention, and a functional protein can still be produced. This fact is made especially * 
clear from Figure 25, which shows a sequence alignment of several plant e-cyclases. As can 
5 be seen from Figure 25, there is an enormous amount of sequence disparity between amino 
acid sequences 2 to about 50-70 (depending on the particular sequence, since gaps are 
present). There is less, but also a substantial amount of, sequence dissimilarity between about 
50-70 to about 90-120 (depending on the particular sequence). Thereafter, the sequences are 
fairly conserved, except for small pockets of dissimilarity between about 275-295 to about 

10 285-305 (depending on the particular sequence), and between about 395-415 to about 410- 
430 (depending on the particular sequence). 

The present inventors have found that the amount of the 5 1 region present in the 
nucleic acids of the invention can alter the activity of the enzyme. Instead of diminishing 
activity, truncating the 5' region of the nucleic acids of the invention may result in an enzyme 

15 with a different specificity. Thus, the present invention relates to nucleic acids and enzymes 
encoded thereby which are truncated to within 0-50, preferably 0-25, codons of the 5' 
initiation codon of their prokaryotic counterparts as determined by alignment maps as 
discussed below. 

For example, when the cDNA encoding A. thaliana p -carotene hydroxylase was 
20 truncated, the resulting enzyme catalyzed the formation of P-cryptoxanthin as the major 

product and zeaxanthin as minor product; in contrast to its normal production of zeaxanthin. 

The present invention is intended to include those nucleic acid and amino acid 
sequences in which substitutions, deletions, additions or other modifications have taken 
place, as compared to SEQ ID NOS: 2, 4, 14-21, 23 or 25-27, without destroying the activity 
25 of the enzyme. Preferably, the substitutions, deletions, additions or other modifications take 
place at the 5' end, or any other of those positions which already show dissimilarity between 
any of the presently disclosed amino acid sequences (see also Figure 25) or other amino acid 
sequences which are known in the art and which encode the same enzyme (i.e., lycopene e- 
cyclase, JPP isomerase or p -carotene hydroxylase). 
30 In each case, nucleic acid and amino acid sequence similarity and identity is measured 

using sequence analysis software, for example, the Sequence Analysis, Gap, orBestFit 
software packages of the Genetics Computer Group (University of Wisconsin Biotechnology 
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Center, 1 7 1 0 University Avenue, Madison, Wisconsin 53705), MEGAlign (DNAStar, Inc., 
1228 S. Park St., Madison, Wisconsin 53715), or MacVector (Oxford Molecular Group, 2105 
S. Bascom Avenue, Suite 200, Campbell, California 95008). Such software uses algorithms 
to match similar sequences by assigning degrees of identity to various substitutions, 
5 deletions, and other modifications, and includes detailed instructions as to useful parameters, 
1 etc., such that those of routine skill in the art can easily compare sequence similarities and 

identities. An example of a useful algorithm in this regard is the algorithm of Needleman and 
Wunsch, which is used in the Gap program discussed above. This program finds the 
alignment of two complete sequences that maximizes the number of matches and minimizes 

10 the number of gaps. Another useful algorithm is the algorithm of Smith and Waterman, 
which is used in the BestFit program discussed above. This program creates an optimal 
alignment of the best segment of similarity between two sequences. Optimal alignments are 
found by inserting gaps to maximize the number of matches using the local homology 
algorithm of Smith a^nd Waterman. 

15 Conservative (i.e. similar) substitutions typically include substitutions within the 

following groups: glycine and alanine; valine, isoleucine and leucine; aspartic acid, glutamic 
acid, asparagine and glutamine; serine and threonine; lysine and arginine; and phenylalanine 
and tyrosine. Substitutions may also be made on the basis of conserved hydrophobicity or 
hydrophilicity (see Kyte and Doolittle;, J. Mol. Biol 157: 105-132 (1982)), or on the basis of 

20 the ability to assume similar polypeptide secondary structure (see Chou and Fasman, Adv. 
Enzymol. 47: 45-148 ( 1 978)). 

If comparison is made between nucleotide sequences, preferably the length of 
comparison sequences is at least 50 nucleotides, more preferably at least 60 nucleotides, at 
least 75 nucleotides or at least 100 nucleotides. It is most preferred if comparison is made 

25 between the nucleic acid sequences encoding the enzyme coding regions necessary for 

- enzyme activity., If comparison is made between amino acid sequences, preferably the length 
of comparison is at least 20 amino acids, more preferably at least 30 amino acids, at least 40 
amino acids or at least 50 amino acids. It is most preferred if comparison is made between 
the amino acid sequences in the enzyme coding regions necessary for enzyme activity. 

30 It should be appreciated that also within the scope of the present invention are nucleic 

acid sequences encoding lycopene e -cyclases, IPP isomerases and P -carotene hydroxylases 
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which code for enzymes having the same amino acid sequence as SEQ ID NOS:,2, 4, 14-21, 
23 or 25-27, but which are degenerate to the nucleic acids specifically disclosed herein. 

The amino acid residues described herein are preferred to be in the "L" isomeric form*. 
However, residues in the "D" isomeric form can be substituted for any L-amino acid residue, 
5 as long as the desired functional property of immunoglobulin-binding is retained by the 
polypeptide. 

In accordance with the present invention there may be employed conventional 
molecular biology, microbiology, and recombinant DNA techniques within the skill of the 
art. Such techniques are explained fully in the literature. See, e.g., Sambrook et al, 

10 "Molecular Cloning: A Laboratory Manual" (1989); "Current Protocols in Molecular 
Biology" Volumes I-III [Ausubel, R. M., ed. (1994)]; "Cell Biology: A Laboratory 
Handbook" Volumes I-III [J. E. Celis, ed. (1994))]; "Current Protocols in Immunology" 
Volumes I-III [Coligan, J. E., ed. (1994)]; "Oligonucleotide Synthesis" (M.J. Gait ed. 1984); 
"Nucleic Acid Hybridization" [B.D. Hames & SJ. Higgins eds. (1985)]; "Transcription And 

15 ■ Translation" [B.D. Hames & S.J. Higgins, eds. (1984)]; "Animal Cell Culture" [R.I. 

Freshney, ed. (1986)]; "Immobilized Cells And Enzymes" [IRL Press, (1986)]; B. Perbal, "A 
Practical Guide To Molecular Cloning" (1984). 

The present invention also includes vectors. Suitable vectors according to the present 
invention comprise a nucleic acid of the invention encoding an enzyme involved in 

20 carotenoid biosynthesis or metabolism and a suitable promoter for the host, and can be 

constructed using techniques well known in the art (for example Sambrook et al., Molecular 
Cloning A Laboratory Manual Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 
1989; Ausubel et al., Current Protocols in Molecular Biology, Greene Publishing and Wiley 
Interscience, New York, 1991). Suitable vectors for eukaryotic expression in plants are 

25 described in Frey et al., Plant J. (1995) 8(5):693 and Misawa et al, 1994a; incorporated herein 
by reference. Suitable vectors for prokaryotic expression include pACYCl 84, pUCl 19, and 
pBR322 (available from New England BioLabs, Bevery, MA) and pTrcHis (Invitrogen) and 
pET28 (Novagen) and derivatives thereof. The vectors of the present invention can 
additionally contain regulatory elements such as promoters, repressors, selectable markers 

30 such as antibiotic resistance genes, etc. 

The nucleic acids encoding the carotenoid enzymes as described above, when cloned 
into a suitable expression vector, can be used to overexpress these enzymes in a plant 
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expression system or to inhibit the expression of these enzymes. For example, a vector 
containing the gene encoding lycopene e-cyclase can be used to increase the amount of oc- 
carotene and carotenoids derived from a-carotene (such as lutein and a-cryptoxanthin) in an 
organism and thereby alter the nutritional value, pharmacology and visual appearance value 
5 of the organism. 

Therefore, the present invention includes a method of producing or enhancing the 
production of a carotenoid in a host cell, relative to an untransformed host cell, the method 
comprising inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having lycopene e-cyclase, IPP isom erase or P- 
10 carotene hydroxylase enzyme activity, wherein the heterologous nucleic acid sequence is , 
operably linked to a promoter; and expressing the heterologous nucleic acid sequence to 
produce the protein. 

The present invention also includes a method of modifying the production of 

carotenoids in a host coll, the method comprising inserting into the host cell a vector 

• » ■ ■ 

15 comprising a heterologous nucleic acid sequence which produces an RNA and/or encodes for 
a protein which modifies lycopene e-cyclase, IPP isOmerase or p -carotene hydroxylase 
enzyme activity, relative to an untransformed host cell, wherein the heterologous nucleic acid 
sequence is operably linked to a promoter; and expressing the heterologous nucleic acid 
sequence in the host cell to modify the production of the carotenoids in the host cell, relative 

20 to the untransformed host cell. 

The term "modifying the production" means that the amount of carotenoids produced 
in the host cell can be enhanced, reduced, or left the same, as compared to the untransformed 
host cell. In accordance with one embodiment of the present invention, the make-up of the 
carotenoids (i.e., the specific carotenoids produced) is changed vis a vis each other, and this 

25 change in make-up may result in either a net gain, net loss, or no net change in the total 

amount of carotenoids produced in the cell. In accordance with another embodiment of the 
present invention, the production or the biochemical activity of the carotenoids (or the 
enzymes which catalyze their formation) is enhanced by the insertion of an enzyme-encoding 
nucleic acid of the invention. In yet another embodiment of the invention, the production or 

30 the biochemical activity of the carotenoids (or the enzymes which catalyze their formation) 
may be reduced or inhibited by a number of different approaches available to those skilled in 
the art, including but not limited to such methodologies or approaches as anti-sense (e.g., 
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Gray et al (1992) Plant Mol. Biol. 19:69-87), ribozymes (e.g., Wegener et al (1994) Mol. 
Gen. Genet. 245:465-470), co-suppression (e.g., Fray and Grierson (1993) Plant Mol. Biol/ 
22:589-602), targeted disruption of the gene (e.g., Schaefer et al. (1997) Plant J. 11:1 195- * 
1206), intracellular antibodies (e.g., Rondon and Marasco (1997) Ann. Rev. Microbiol. 
5 51 :257-283) or whatever other approaches rely on the knowledge or availability of the nucleic , 
acid or amino acid sequences of the invention and/or portions thereof, to thereby reduce 
accumulation of carotenoids with e rings and compounds derived from them (for e-cyclase 
inhibition), or carotenoids with hydroxylated P rings and compounds derived from them (for 
p -hydroxylase inhibition), or, in the case if IPP isomerase, accumulation of any isoprenoid 
10 compound. 

Preferably, at least a portion of the nucleic acid sequences used in the methods, 
vectors and host cells of the invention codes for an enzyme having an amino acid sequence 
which is at least 85% identical, preferably at least 90%, at least 95% or completely identical 
to SEQ ID NOS: 2, 4, 14-21, 23 or 25-27. Sequence identity is determined as noted above. 

1 5" Preferably, sequence additions, deletions or other modifications are made as indicated above, 
so as to not affect the function of the particular enzyme. 

In a preferred embodiment, vectors are manufactured which contain a DNA encoding 
a eukaryotic IPP isomerase upstream of a DNA encoding a second eukaryotic carotenoid 
enzyme. The inventors have discovered that inclusion of an IPP isomerase gene increases the 

20 supply of substrate for the carotenoid pathway; thereby enhancing the production of 

carotenoid endproducts, as compared to a host cell which is not transformed with such a 
vector. This is apparent from the much deeper pigmentation in carotenoid-accumulating 
colonies of E. coli which also contain one of the aforementioned IPP isomerase genes when 
compared to colonies that lack this additional IPP isomerase gene. Similarly, a vector 

25 comprising an IPP isomerase gene can be used to enhance production of any secondary 
metabolite of dimethylallyl pyrophosphate and/or isopentenyl pyrophosphate (such as 
isoprenoids, steroids, carotenoids, etc.). The term "isoprenoid" is intended to mean any 
member of the class of naturally occurring compounds whose carbon skeletons are composed, 
in part or entirely, of isopentyl C 5 units. Preferably, the carbon skeleton is of an essential oil, 

30 a fragrance, a rubber, a carotenoid, or a therapeutic compound, such as paclitaxel. 

A vector containing the cDNA encoding a lycopene €-cyclase of the invention, 
preferably the lettuce lycopene e-cyclase or Adonis €-cyclase #5, can be used to increase the 
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amount of bicyclic € -carotene in an orgaitism and thereby alter the nutritional value, 
pharmacology and visual appearance value of the organism. In addition, the transformed 
organism can be used in the formulation of therapeutic agents, for example in the treatment of 
cancer (see Mayne et al (1996) FASEB J. 10:690-701; Tsushima et al (1995) Biol. Pharm. 
5 Bull. 18:227-233). 

An antisense strand of a nucleic acid of the invention can be inserted into a vector. 
For example, the lycopene e-cyclase gene can be inserted into a vector and incorporated into 
the genomic DNA of a host, thereby inhibiting the synthesis of e,p -carotenoids (lutein and a- 
carotene) and enhancing the synthesis of p,p -carotenoids (zeaxanthin and p -carotene). 

10 The present invention also relates to noyel.enzyijies which are encoded by the aminp 

acid sequences of the invention, or portions thereof. 

The present invention also relates to novel enzymes which can transform known 
carotenoids into novel or uncommon products. Currently e-carotene (see Figure 2) and y- 
carotene are commonly produced only in minor amounts. As described below, an enzyme 

1 5 can be produced which transforms lycopene to y -carotene and lycopene to e-carotene. With 
these products in hand, bulk synthesis of other carotenoids derived from them are possible. 
For example, e-carotene can be hydroxylated to form laetucaxanthin, an isomer of lutein (one 
e and one P ring) and zeaxanthin (two P rings) where both endgroups are, instead, e rings. 

In addition to novel enzymes produced by truncating the 5* region of known enzymes, 

20 as discussed above, novel enzymes which can participate in the formation of unusual 

carotenoids can be formed by replacing portions of one gene with an analogous sequence 
from a structurally related gene. For example, p -cyclase and e-cyclase are structurally 
related (see Figure 13). By replacing a portion of p -lycopene cyclase with the analogous 
portion of e-cyclase, an enzyme which produces y -carotene will be produced (one P 

25 endgroup). Further, by replacing a portion of the lycopene e-cyclase with the analogous 
portion of P -cyclase, an enzyme which produces e-carotene will be produced (with some 
exceptions, such as the lettuce e-cyclase, plant e -cyclases normally produce a compound with 
one e-endgroup, 6 -carotene). Similarly, p -hydroxylase could be modified to produce 
enzymes of novel function by creation of hybrids with e-hydroxylase. 

30 Host systems according to the present invention can comprise any organism that 

already produces carotenoids or which has been genetically modified to produce carotenoids. 
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The IPP isomerase genes are more broadly applicable for enhancing production of any 
product dependent on DMAPP and/or IPP as a precursor. 

Organisms which already produce carotenoids include plants, algae, some yeasts, 
fungi and cyanobacteria and other photosynthetic bacteria. Transformation of these hosts 
with vectors according to the present invention can be done using standard techniques such as 
those described in Misawa et al., (1990) supra; Hundle et al., (1993) supra; Hundle et al., 
(1991) supra; Misawa et al., (1991) supra; Sandmann et al., supra; and Schnurr et al., supra. 

Transgenic organisms can be constructed which include the nucleic acid sequences of 
the present invention (Bird et al, 1991; Bramley et al, 1992; Misawa et al, 1994a; Misawa et 
al, 1994b; Cunningham et al, 1993). The incorporation of these sequences can allow the 
controlling of carotenoid biosynthesis, content, or composition in the host cell. These 
transgenic systems can be constructed to incorporate sequences which allow for the 
overexpression of the nucleic acids of the present invention. Transgenic systems can also be 
constructed containing antisense expression of the nucleic acid sequences of the present 
invention. Such antisense expression would result in the accumulation of the substrates of the 
substrates of the enzyme encoded by the sense strand. 

A method for screening for eukaryotic genes which encode enzymes involved in 
carotenoid biosynthesis comprises transforming a prokaryotic host with a nucleic acid which 
may contain a eukaryotic or prokaryotic carotenoid biosynthetic gene; culturing said 
transformed host to obtain colonies; and screening for colonies exhibiting a different color 
than colonies of the untransformed host. 

Suitable hosts include E. coli, cyanobacteria such as Synechococcus and 
Synechocystis, alga and plant cells. E. coli are preferred. 

In a preferred embodiment, the above "color complementation" screening protocol can 
be enhanced by using mutants which are either (1) deficient in at least one carotenoid 
biosynthetic gene or (2) overexpress at least one carotenoid biosynthetic gene. In either case, 
such mutants will accumulate carotenoid precursors. 

Prokaryotic and eukaryotic DNA or cDNA libraries can be screened in total for the 
presence of genes of carotenoid biosynthesis, metabolism and degradation. Preferred 
organisms to be screened include photosynthetic organisms. 
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E. coli can be transformed with these eukaryotic cDNA libraries using conventional 
methods such as those described in Sambrook et al, 1989 and according to protocols 
described by the vendors of the cloning vectors. 

For example, the cDNA libraries in bacteriophage vectors such. as lambdaZAP 
5 (Stratagene) or lambda ZIPLOX (Gibco BRL) can be excised en masse and used to transform 
E.coli. 

Transformed E. coli can be cultured using conventional techniques. The culture broth 
preferably contains antibiotics to select and maintain plasmids. Suitable antibiotics include 
penicillin, ampicillin, chloramphenicol, etc. Culturing is typically conducted at 15-40°C, 

1 0 preferably at room temperature or slightly above (1 8-28°C), for 12 hours to 7 days. 

Cultures are plated and the plates are screened visually for colonies with a different 
color than the colonies of the host E, coli transformed with the empty plasmid cloning vector. 
For example, E. coli transformed with the plasmid, pAC-BETA (described below), produce 
yellow colonies that accumulate p -carotene. After transformation with a cDNA library, 

15 colonies which contain a different hue than those formed by E. co/z/p AC-BETA would be 
expected to contain enzymes which modify the structure or accumulation of p -carotene. 
Similar E. coli strains can be engineered which accumulate earlier products in carotenoid 
biosynthesis, such as lycopene, y -carotene, etc. 

Having generally described this invention, a further understanding can be obtained by 

20 reference to certain specific examples which are provided herein for purposes of illustration 
only and are notintended to be limiting unless otherwise specified. 

EXAMPLE 

I. Isolation of P -carotene hydroxylase 
Plasmid Construction 

25 An 8.6kb.BgUI fragment containing the carotenoid biosynthetic genes of Erwinia 

herbicola was first cloned in the BamHI site of plasmid vector pACYC184 (chloramphenicol 
resistant), and then a 1 .lkb BamHI fragment containing the E. herbicola P -carotene 
hydroxylase (CrtZ) was deleted. E.coli strains containing the resulting plasmid, p AC-BETA, 
accumulate B-carotene and form yellow colonies (Cunningham et al., 1 994). 

30 A full length cDNA encoding IPP isomerase of Haematococcus pluvialis (HP04) was 

first excised with BamHI and Kpnl from pBluescript SK-, and then ligated into the 
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corresponding sites of the pTrcHisA vector with high-level expression from the *rc promoter 
(Invitrogen, Inc.). A fragment containing the IPP isomerase and trc promoter was 
subsequently excised with EcoKV and Kprii, treated with the Klenow fragment of DNA * 
polymerase to produce blunt ends, and ligated in the Klenow-treated HindlU site of pAC- 
5 BETA. E.coli cells transformed with this new plasmid pAC-BETA-04 form orange colonies 
on LB plates (vs. yellow for those containing pAC-BETA) and cultures accumulate 
substantially more p -carotene (ca. two fold) than those that contain pAC-BETA. 

Screening of an Arabidopsis cDNA Library 

Several X cDNA expression libraries of Arabidopsis were obtained from the 

10 Arabidopsis Biological Resource Center (Ohio State University, Columbus, OH) (Kieber et 
al., 1993). The X cDNA libraries were excised in vivo using Stratagene's ExAssist SOLR 
system to produce a phagemid cDNA library wherein each phagemid contained also a gene 
conferring resistance to the antibiotic ampicillin. 

E.coli strain DH1 OBZIP was chosen as the host cell for the screening and pigment 

15 production, although we have also used TOPIOF* and XLl-Blue for this purpose. DH10B 
cells were transformed with plasmid pAC-BETA-04 and were plated on LB agar plates 
containing chloramphenicol at 50 ixg/ml (from United States Biochemical Corporation). The 
phagemid Arabidopsis cDNA library was then introduced into DH1 OB cells already 
containing pAC-BETA-04. Transformed cells containing both p AC-BET A-04 and 

20 Arabidopsis cDNA library phagemids were selected on chloramphenicol plus ampicillin (150 
Hg/ml) agar plates. Maximum color development occurred after 3 to 7 days incubation at 
room temperature, and the rare bright yellow colonies were selected from a background of 
many thousands of orange colonies on each agar plate. Selected colonies were inoculated 
into 3 ml liquid LB medium containing ampicillin and chloramphenicol, and cultures were 

25 incubated at room temperature for 1-2 days, with shaking. Cells were then harvested by 
centrifugation and extracted with acetone in microfiige tubes. After centrifugation, the 
pigmented extract was spotted onto silica gel thin-layer chromatography (TLC) plates, and 
developed with a hexanerether (1 :1, by volume) mobile phases. B-carotene hydroxylase- 
encoding cDNAs were identified based on the appearance of a yellow pigment that co- 

30 migrated with zeaxanthin on the TLC plates. 
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Suhcloning and Sequencing 

The plasmid containing the B-carotene hydroxylase cDNA was recovered and . 
analyzed by standard procedures (Sambrook et ah, 1989). The Arabidopsis B-carotene 
hydroxylase was sequenced completely on both strands on an automatic sequencer (Applied 
5 Biosystems, Model 373 A, Version 2.0. 1 S). The cDNA insert of 0.95kb also was excised 
and ligated into the a pTrcHis vector. A Bglll restriction site within the cDNA was used to 
remove that portion of the cDNA that encodes the predicted polypeptide N terminal sequence 
region that is not also found in bacterial (i-carotene hydroxylases (Figure 6). A Bglll-Xhol 
fragment was directionally cloned in BamHI-XhoI digested TrcHis vectors. 

10 Pigment Analysis 

A single colony was used to inoculate 50 ml of LB containing ampicillin and , 
chloramphenicol in a 250-ml flask. Cultures were incubated at 28°C for 36 hours with gentle 
shaking, and then harvested at 5000 rpm in an SS-34 rotor. The cells were washed once with 
distilled H 2 0 and resuspended with 0.5 ml of water. The extraction procedures and HPLC 
15 were essentially as described previously (Cunningham et al, 1994). 

II. Isolation and biochemical analysis of an Arabidopsis lycopene €-cvclase 
Plasmid Construction 

Construction of plasmids pAC-LYC, pAC-NEUR, and pAC-ZETA is described in 
Cunningham et al., (1994). In brief, the appropriate carotenoid biosynthetic genes from 

20 Erwinia herbicola, Rhodobacter capsulatus, and Synechococcus sp. strain PCC7942 were 

cloned in the plasmid vector pACYCl 84 (New England BioLabs, Beverly, MA). Cultures of 
E. coli containing the plasmids pAC-ZETA, pAC-NEUR, and pAC-LYC, accumulate C- 
carotene, neurosporene, and lycopene, respectively. The plasmid pAC-ZETA was 
constructed as follows: an 8.6-kb Bglll fragment containing the_ carotenoid biosynthetic genes _ _ 

25 of E. herbicola (GenBank M87280; Hundle et al., 1991) was obtained after partial digestion 
of plasmid pPL376 (Perry et al., 1 986; Tuveson et al., 1986) and cloned in the BamHI site of 
pACYC184 to give the plasmid pAC-EHER. Deletion of adjacent 0.8- and 1.1-kb BamHI- 
BamHI fragments (deletion Z in Cunningham et al., 1994), and of a 1.1 kB Sall-Sall fragment 
(deletion X) served to remove most of the coding regions for the E. herbicola P -carotene 

30 hydroxylase (crtZ gene) and zeaxanthin glucosyltransferase (crtX gene), respectively. The 
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resulting plasmid, pAC-BETA, retains functional genes for geranylgeranyl pyrophosphate 
synthase (crtE), phytoene synthase (crtB), phytoene desaturase (crtl), and lycopene cyclase ' 
(crtY). Cells of E. coli containing this plasmid form yellow colonies and accumulate P- 
carbtene. A plasmid containing both the lycopene e- and P -cyclase cDNAs of A. thaliana 
5 was constructed by excising the e-cyclase in clone y2 as a PvuI-PvuII fragment and ligating 
this piece in the SnaBI site of a plasmid (pSPORT 1 from GEBCO-BRL) that already 
contained the p -cyclase (Cunningham et ah, 1996). 

Organisms and Growth Conditions 

E. coli strains TOP 10 and TOP 10 F* (obtained from Invitrogen Corporation, San 
10 Diego, CA) and XL 1 -Blue (Stratagene) were grown in Luria-Bertani (LB) medium 

(Sambrook et aL, 1989) at 37°C in darkness on a platform shaker at 225 cycles per min. 

Media components were from Difco (yeast extract and tryptone) or Sigma (NaCl). 

Ampicillin at 150 ng/mL and/or chloramphenicol at 50 ^ig/mL (both from United States 

Biochemical Corporation) were used, as appropriate, for selection and maintenance of 
1 5 plasmids. 

Mass Excision and Color Complementation Screening of an A. thalia na cDNA Library 

A size-fractionated 1-2 kB cDNA library of A. thaliana in lambda ZAPII (Kieber et 
al., 1993) was obtained from the Arabidopsis Biological Resource Center at The Ohio State 
University (stock number CD4-14). Other size fractionated libraries were also obtained 

20 (stock numbers CD4-13, CD4-15, and CD4-16). An aliquot of each library was treated to 

cause a mass excision of the cDNAs and thereby produce a phagemid library according to the 
instructions provided by the supplier of the cloning vector (Stratagene; E. coli strain XL1- 
Blue and the helper phage R408 were used). The titre of the excised phagemid was 
determined and the library was introduced into a lycopene-accumulating strain of E. coli 

25 TOP 10 F* (this strain contained the plasmid pAC-LYC) by incubation of the phagemid with 
the E. coli cells for 15 min at 37 °C. Cells had been grown overnight at 30°C in LB medium 
supplemented with 2% (w/v) maltose and 10 mM MgS0 4 (final concentration), and harvested 
in 1.5 ml microfuge tubes at a setting of 3 on an Eppendorf microfuge (5415C) for 10 min. 
The pellets were resuspended in 10 mM MgS0 4 to a volume equal to one-half that of the 
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initial culture volume. Transformants were spread on large (150 mm diameter) LB agar petri 
plates containing antibiotics to provide for selection of cDNA clones (ampicillin) and . 
maintenance of pAC-LYC (chloramphenicol). Approximately 10,000 colony forming units 
were spread on each plate. Petri plates were incubated at 37-C for 16 hr and then at room 
5 temperature for 2 to 7 days to allow maximum color development. Plates were screened 
1 visually with the aid of an illuminated 3x magnifier and a low power stage-dissecting 

microscope for the rare, pale pinkish-yellow to deep-yellow colonies that could be observed 
in the background of pink colonies. A colony color of yellow or pinkish-yellow was taken as 
presumptive evidence of a cyclization activity. These yellow colonies were collected with 

1 0 sterile toothpicks and used to inoculate 3ml of LB medium in culture tubes with overnight 
growth at 37 °C and shaking at 225 cycles/min. Cultures were split into two aliquots in 
microfuge tubes and harvested by centrifiigation at a setting of 5 in an Eppendorf 541 5C ' 
microfuge. After discarding the liquid, one pellet was frozen for later purification of plasmid 
DNA. To the second pellet was added 1.5 ml EtOH, and the pellet was resuspended by 

15 vortex mixing, and extraction was allowed to proceed in the dark for 15-30 min with 

occasional remixing. Insoluble materials were pelleted by centrifiigation at maximum speed 
for 10 min in a microfuge. Absorption spectra of the supernatant fluids were recorded from 
350-550 nm with a Perkin Elmer lambda six spectrophotometer. 

Analysis of isolated clones 

20 Eight of the yellow colonies contained p -carotene indicating that a single gene 

product catalyzes both cyclizations required to form the two p endgroups of the symmetrical 
p -carotene from the symmetrical precursor lycopene. One of the yellow colonies contained a 
pigment with the spectrum characteristic of 5 -carotene, a monocyclic carotenoid with a single 
e endgroup. Unlike the P cyclase, this e-cyclase appears unable to carry out a second 

25 cyclization at the other end of the molecule. 

The observation that e-cyclase is unable to form two cyclic e-endgroups (e.g. the 
bicyclic e-carotene) illuminates the mechanism by which plants can coordinate and control 
the flow of substrate into carotenoids derived from P -carotene versus those derived from cc- 
carotene and also can prevent the formation of carotenoids with two e endgroups. 

30 The availability of the A. thaliana gene encoding the e-cyclase enables the directed 

manipulation of plant and algal species for modification of carotenoid content and 
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composition. Through inactivation of the e-cyclase, whether at the gene level by deletion of 
the gene or by insertional inactivation or by reduction of the amount of enzyme formed (by * 
such as antisense technology), one may increase the formation of p -carotene and other * 
pigments derived from it. Since vitamin A is derived only from carotenoids with P 
5 endgroups, an enhancement of the production of p -carotene versus a -carotene may enhance 
nutritional value of crop plants. Reduction of carotenoids with e-endgroups may also be of 
value in modifying the color properties of crop plants and specific tissues of these plants. 
Alternatively, where production of a -carotene, or pigments such as lutein that are derived 
from a -carotene, is desirable, whether for the color properties, nutritional value or other 

10 reason, one may overexpress the e-cyclase or express it in specific tissues. Wherever 

agronomic value of a crop is related to pigmentation provided by carotenoid pigments the 
directed manipulation of expression of the e-cyclase gene and/or production of the enzyme 
may be of commercial value. 

The predicted amino acid sequence of the A. thaliana e-cyclase enzyme was 

15* determined. A comparison of the amino acid sequences of the P- and e-cyclase enzymes of 
Arabidopsis thaliana (Fig. 13) as predicted by the DNA sequence of the respective cDNAs 
(Fig. 4 for the e-cyclase cDNA sequence), indicates that these two enzymes have many 
regions of sequence similarity, but they are only about 37% identical overall at the amino acid 
level. The degree of sequence identity at the DNA base level, only about 50%, is sufficiently 

20 low such that we and others have been unable to detect this gene by hybridization using the p 
cyclase as a probe in DNA gel blot experiments. 
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We claim: 4 : ' 

1 . An isolated and/or purified nucleic acid sequence which encodes for a protein having 
lycopene e-cyclase enzyme activity and has an amino acid sequence which is at least 85% 
identical to one of SEQ ID NOS: 23 or 25-27. 

5 2. The nucleic acid sequence of claim 1 , wherein the protein has the amino acid 
sequence of one of SEQ ID NOS: 23 or 25-27. 

3. A vector comprising the nucleic acid sequence of claim 1, wherein the nucleic acid , 
sequence is operably linked to a promoter. 

4. A host cell which contains the vector of claim 3. 

-« * * ■ 
10 5. The host cell of claim 4, wherein the host cell is selected from the group consisting of 

a bacterial cell, an algal cell, a yeast cell and a plant cell. 

6. The host cell of claim 4, wherein the host cell is a photosynthetic cell. 

7. An isolated and/or purified protein having lycopene e-cyclase enzyme activity and 
having an amino,acid sequence which is at least 85% identical to one of SEQ ID NOS: 23 or 

15 25-27. 

8. The protein of claim 7, wherein the protein has the amino acid sequence of one of 
SEQ ID NOS: 23 or 25-27. 



-27- 



BNSDOCID: <WO 9963055A1_I_> 



WO 99/63055 




PCTAJS99/12121 



AMENDED CLAIMS 

* [received by the International Bureau on 1 5 November 1 999 (15.11 .99); 

original claims 1,2,7 and 8 amended; remaining claims unchanged (1 page)] 
1 . An isolated and/or purified nucleic acid sequence which encodes for a protein having 

lycopene e-cyclase enzyme activity and has an amino acid sequence which is at least 85% 

identical to one of SEQ ID NOS: 23, 25 or 26. 

5 2. The nucleic acid sequence of claim 1, wherein the protein has the amino acid 
sequence of one of SEQ ID NOS: 23, 25 or 26. 

3. A vector comprising the nucleic acid sequence of claim 1 , wherein the nucleic acid 
sequence is operably linked to a promoter. 

4. A host cell which contains the vector of claim 3. 

It) 5. The host cell of claim 4, wherein the host cell is selected from the group consisting of 
a bacterial cell, an algal cell, a yeast cell and a plant cell. 

6. The host cell of claim 4, wherein the host cell is a photosynthetic cell. 

7. An isolated and/or purified protein having lycopene e-cyclase enzyme activity and 
having an amino acid sequence which is at least 85% identical to one of SEQ ID NOS: 23, 25 

15 or 26; 

8. The protein of claim 7, wherein the protein has the amino acid sequence of one of 
SEQ ID NOS: 23, 25 or 26. 



28 

AMENDED SHEET (ARTICLE 19) 

JSDOCID: <WO_9963055A!J_» 



WO 99/63055 




PCT/US99/12121 



1,,./, 30 



MVA IPP OMAPP GPP 




neurosporene 




xanthophyils abscisic add 



FIGURE 1 



9963055A1 I > 



WO 99/63055 SKS PCT/US99/12121 

2 / 30 



it 

'I J It 7 



CjC"' R " 

P-zeacaroten'e 



j neurosporene ' r . 

I , lycopene V 
g - o-carotene 

^ e-carote^ 'Vk' ^ V s ! 

a-carotene P^ir*^'* * ""^-^X. 

I' 

&r*^Q Qr R ' J « jQr-*-^ 



y-carotene 



I hydroxylase 



P-carotene 

hydroxylase 



HO 

a-cryptoxanthin 



zeinoxanthin ho"^-^ P-cryptoxanthin 

! ' i 

3^- ^ [ hydroxylase 



"° ,utein ho- 1 ^ zeaxanthin 



FIGURE 2 



9963055A1_L> 




BNSDOCID: <WO 9963055A1 J_> 



WO 99/63055 || PCT/US99/1212I 

4/30 



Arabidopsis thaliana epsilon cyclase: 



triiiiffiiiiiitm tttcctmwtfctifct n iwi tf u «• 

««MMUUMMC«ftftMftRtRC fCTffUC iml UWtl MHI | |1M 




aivgaQQMEOMO IQ«KLV0>>L» 



ioi»x«ioooalo avvxoco»*a 
tax. b Z ul * « • a c u ax,»voi>xo)»o 




■ i • C * 7 iO V • T L ••EVOIXT.A 




3*1 T V A t O *. * 

y(ciftf(KMmr»Mff«itii.f t» M —Jt«ft y ********* M 

1«1 V C V Q T A « O V ■ VSVSB»»V8» 

an o q ■ v r ■ o t « o tr»i*v»«L« 

— — mtmaMMMtawtttl cctmi«n»Ttriifi^"tCTtOfM !•«• 



«Ut«H1H»tCtC1ttRRtlfCtltl umn*«»WMM«y^«"! 



131 t L * V 




FIGURE 4 



<ISDOCID: <WO 9963055A1_I_> 



WO 99/63055 




rCT/US99/12121 



5 / 30 



1 gctctttctc ctcctcctct accg.tttce gactccgcct: cccgaaatcc 
51 ttatccggat tc t «ccgtc tc«cg.t« aaacgctttt ctgtc*g«. 
101 cgtcgtcgaa gaacggag.c agaattctcc gattg.gaac gacgagagac 
151 cggagagcac gagctcc.c. aacgct.tag acgctg.gta tctggcgttg 
.201 cgtttggcgg agaaattgg. gaggaagaaa" tcggagagg* cc»c«atct: 
• 251 aatcgctgcft atgttgtega gct«gg*.t cacttetatg gctgtratgg 
301 «g*«act:a cag.ttctct tggcaaatgg aggg.gg t g. g.tctc.atg 
351 ttgg.aatg* ttggtacatt tgctctctc* gttggtgctg ctg^ggtaf 
401 ggaattctgg gcaagatggg ctcaeag.gc tctgtggcac gct*ctc*a* 
451 ggaatatgca tgagtcacat eaeaaaecaa gagaaggaee gtttgagcta 
, SOI.", aacgatgttt ttgctatagt gaacgctgg* ccagcgafctg gtctcctctc 
SSI ttatggattc ttcaataaag g.ctcgttce tgg*c*c*,c tttggcgccg 
601 ggtfcaggca* aacgg*g«t ggaatcgcct acatgtttg* ccacgatgg* 
651 ctcgtgcaea agcgxttcee' tgtaggtccc aecgccgacg tccctt.cct 
701 ccgaaaggtc gccgccgctc accagctaca tcaeacagac aagtteaatg 
751 grgtaccata tggactgttt cttggaccca agga.ttgg* agaagttgga 
801 ggaaatgaag agttagataa ggagattagt cggagaatca aatcatacaa 
,851 aaaggcctcg ggctccgggt cgagttcgag ttcttgactt taaacaagtt 
901 ttaaatecca aattcttttt ttgtcttctg tcattatgat catcttaaga 
951 cggfcct 
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1 ccacgggtcc gcctccccgt: trtttttccga tccgatic^cc ggtgccgagg 

51 acfccagctgt. ttgttcgcgc tttctcagcc gtcaccatga ccgattctaa 

101 cgatgcrtgga atggatgctg ttcagagacg acrtcatg^tt. gaagacgaat 

151 gcatHcrtcgt: tgatgaaaat aatcgt:gtgg tgggacatga cactaagtat 

201 aactgtcatc tgatggaaaa gattgaag<rt gagaatttac ttcacagagc 

251 tttcagtgrtg tttttattca actccaagta tgaglrtgeft ctccagcaac 

301 ggrtcaaaaac aaaggttact ttcccacirtg tg-tggacaaa cacrttq^tgc 

3 51 agccatcctc tttaccgtga atccgagcrtt. attgaagaga atg^gcttgg 

401 tgt:aagaaat gccgcacaaa ggaagcrtttH cgatgagcrtc ggtattg^ag 

451 cagaagatg^ accagtcgat gagfctcacrtc ccrftgggacg catgcrfttac 

501 aaggcacctt. crtgatgggaa atggggagag cacgaagtrtg actatctact: 

551 ettcat;cg*g cgggatgtga agcttcaacc aaacccagat gaagtggctg 

601 agatcaagta cgt:gagcagg gaagagrtta aggagctggt: gaagaaagca 

651 gatgctggcg atgaag ct g t gaaactatct ccatggttca gattggtggt: 

701 ggataatttc ttigatgaag* ggtgggatca tgttigagaaa ggaactatca 

751 ctgaagctgc agacatgaaa accattcaca agcHctrgaac tttccataag 

801 ttttggatct: tccccttccc ataataaaat taagagatga gacttttatt: 

851 gattacagac aaaactggca acaaaatcta ttcctaggat ttttttttgc 

901 tttttattrfca c«:t:tgatrtc atctctagrtt. tagtwtcat cttaaaaaaa 

951 aaaa 
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1 CTCGGTAGCT GGCCACAATC GCTATTTGGA ACCTGGCCCG GCGGCAGTCC 
51 GATGCCGCGA TGCTTCGTTC GTTGCTCAGA GGCCTCACGC ATATCCCCCG 
101 CGTGAACTCC GCCCAGCAGC CCAGCTGTGC ACACGCGCGA CTCCAGTTTA 
151 AGCTCAGGAG CATGCAGATG ACGCTCATGC AGCCCAGCAT CTCAGCCAAT 
201 CTGTCGCGCG CCGAGGACCG CACAGACCAC ATGAGGGGTG CAAGCACCTG 
251 GGCAGGCGGG CAGTCGCAGG ATGAGCTGAT GCTGAAGGAC GAGTGCATCT 

3 01 TGGTGGATGT TGAGGACAAC ATCACAGGCC ATGCCAGGAA GCTGGAGTGT 
351 CACAAGTTCC TACCACATCA GCCTGCAGGC CTGCTGCACC GGGCCTTCTC 

4 01 TGTGTTCCTG TTTGACGATC AGGGGCGACT GCTGCTGCAA CAGCGTGCAC 
451 GCTCAAAAAT CACCTTCCCA AGTGTGTGGA CGAACACCTG CTGCAGCCAC 
501 CCTTTACATG GGCAGACCCC AGATGAGGTG GACCAACTAA GCCAGGTGGC 
551 CGACGGAACA GTACCTGGCG CAAAGGCTGC TGCCATCCGC AAGTTGGAGC 
601 ACGAGCTGGG GATACCAGCG CACCAGCTGC CGGCAAGCGC GTTTCGCTTC 
651 CTCACGCGTT TGCACTACTG TGCCGCGGAC GTGCAGCCAG CTGCGACACA 
701 ATCAGCGCTC TGGGGCGAGC ACGAAATGGA CTACATCTTG TTCATCCGGG 
751 CCAACGTCAC CTTGGCGCCC AACCCTGACG AGGTGGACGA AGTCAGGTAC 
801 GTGACGCAAG AGGAGCTGCG GCAGATGATG CAGCCGGACA ACGGGCTGCA 
851 ATGGTCGCCG TGGTTTCGCA TCATCGCCGC GCGCTTCCTT GAGCGTTGGT 
901 GGGCTGACCT GGACGCGGCC CTAAACACTG ACAAACACGA GGATTGGGGA 
951 ACGGTGCATci ACATCAACGA AGCGTGAAAG CAGAAGCTGC AGGATGTGAA 

1001 GACACGTCAT GGGGTGGAAT TGCGTACTTG GCAGCTTCGT ATCTCCTTTT 
1051 TCTGAGACTG AACCTGCAGT CAGGTCCCAC AAGGTCAGGT AAAATGGCTC 
1101 GATAAAATGT ACCGTCACTT TTTGTCGCGT ATACTGAACT CCAAGAGGTC 
1151 AAAAAAAAAA AAAAA 
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1 CTCGGT AG CT GGCCACAATC GCTATTTGGA ACCTGGCCCG GCGGCAGTCC 
51 GATGCCGCGA TGCTTCGTTC GTTGCTCAGA GGCCTCACGC ATATCCCGCG 
101 CGTGAACTCC GCCCAGCAGC CCAGCTGTGC ACACGCGCGA CTCCAGfTTA ' 
151 AGCTCAGGAG CATGCAGCTG CTTTCCGAGG ACCGCACAGA CGACATGAGG 
201 GGTGCAAGCA CCTGGGCAGG CGGGCAGTCG CAGGATGAGC TGATGCTGAA 
251 GGACGAGTGC ATCTTGGTAG ATGTTGAGGA CAACATCACA GGCCATGCCA 
301 GCAAGCTGGA GTGTCACAAG TTCCTACCAC ATCAGCCTGC AGGCCTGCTG 

3 51 CACCGGGCCT TCTCTGTGTT CCTGTTTGAC GATCAGGGGC GACTGCTGCT 

4 01 GCAACAGCGT GCACGCTCAA AAATCACCTT CCCAAGTGTG TGGACGAACA 
4 51 CCTGCTGCAG CCACCCTTTA CATGGGCAGA CCCCAGATGA GGTGGACCAA 
501 CTAAGCCAGG TGGCCGACGG AACAGTACCT GGCGCAAAGG CTGCTGCCAT 
551 CCGCAAGTTG GAGCACGAGC TGGGGATACC AGCGCACCAG CTGCCGGCAA 
601 GCGCGTTTCG CTTCCTCACG CGTTTGCACT ACTGTGCCGC GGACGTGCAG 
651 CCAGCTGCGA CACAATCAGC GCTCTGGGGC GAGCACGAAA TGGACTACAT 
701 CTTGTTCATC CGGGCCAACG TCACCTTGGC GCCCAACCCT GACGAGGTGG 
7 51 ACGAAGTCAG GTACGTGACG CAAGAGGAGC TGCGGCAGAT GATGCAGCCG 
801 ' GACAACGGGC TTCAATGGTC GCCGTGGTTT CGCATCATCG CCGCGCGCTT 
851 CCTTGAGCGT TGGTGGGCTG ACCTGGACGC GGCCCTAAAC ACTGACAAAC 
901 ACGAGGATTG GGGAACGGTG CATCACATCA ACGAAGCGTG AAGGCAGAAG 
951 CTGCAGGATG TGAAGACACG TCATGGGGTG GAATTGCGTA CTTGGCAGCT 

1001 TCGTATCTCC TTTTTCTGAG ACTGAACCTG CAGAGCTAGA GTCAATGGTG 
1051 CATCATATTC ATCGTCTCTC TTTTGTTTTA GACTAATCTG TAGCTAGAGT 
1101 CACTGATGAA TCCTTTACAA CTTTCAAAAA AAAAA 



FIGURE 10 



WO 99/63055 W ' W PCT/US99/12121 

11/30 

1 5fl 
HPoV • MLRS1XRGLT HIPRVNSAQQ PSCAHARLQF KLRSMQMTLM QPSISANLSR 

MLRSLLRGLT HIPRVNSAQQ PSCAHARLQF KLRSMQLL 
ATDP-q- MSVSSLFNLP .LIRLRSLA. LSSSFSSFRF AHRPLSSIS* PRKLPNFRAF 
C MS . SSMLNFT . ASRIVSLPL LSSPPSRVHL PLCFFSPISL TQRFSAKLTP 

/*rc**5 . TGPPPRFFP IRSPVPRTQL FVRAFSAV . *** VA- * r 

CWJ ..MTAONNSM PHGAVSSYAK LVQNQTPEDI LEEFPEIIPL QQRPn!."!tr 

51 

AEDRTDHMRG ASTWAGGQSQ DELMLKDECI LVDVEDNITG HASKLECHKF 
SEDRTDHMRG ASTWAGGQSQ DELMLKDECI LVDVEDNITG HASKLECHKF 
S..GTA.MTD TKDAGMDAVQ RRLMFEDECI LVD ETDRWG HVSKYNCHLM 
SSQATT.MGE WDAGMDAVQ RRLMFEDECI LVD ENDKWG HESKYNCHLM 
• • • . .T.MTD SNDAGMDAVQ RRLMFEDECI LVDENNRWG HDTKYNCHLM 
SSETSNDESG ETCFSGHDEE Q IKLMNENC I VLDWDDNAIG AGTKKVCHLM 

101 150 
LPHQPAGLLH RAFSVFLFDD QGRLLLQQRA RSKITFPSVW TNTCCSHPLH 
LPHQPAGLLH RAFSVFLFDD QGRLLLQQRA RSKITFPSVW TNTCCSHPLH 
ENIEAKNLLH RAFSVFLFNS KYELLLQQRS NTKVTFPLVW TNTCCSHPLY 
EKIESENLLH RAFSVFLFNS KYELLLQQRS ATKVTFPLVW TNTCCSHPLY 
EKIEAENLLH RAFSVFLFNS KYELLLQQRS KTKVTFPLVW TNTCCSHPLY 
ENIE..KGLLH RAFSVFIFNE QGELLLQQRA TEKITFPDLW TNTCCSHPLC 

151 200 

GQTPDEVDQL SQVADGTVPG AKAAAIRKLE HELG IP AHQL PA. SAFRFLT 

GQTPDEVDQL SQVADGTVPG AKAAAIRKLE HELG IP AHQL PA. SAFRFLT 

RE SELIQDNALG VRNAAQRKLL DELGIVAEDV PV.DEFTPLG 

RE SELIDENCLG VRNAAQRKLL DELGIPAEDL PV.DQFIPLS 

RE S ELI EENVLG VRNAAQRKLF DELGIVAEDV PV.DEFTPLG 

ID...DELGL KGKLDDKIKG AITAAVRKLD HELGIPEDET KTRGKFHFLN 



201 

RLHYCAADVQ PAATQSALWG EHEMDYILFI 
RLHYCAADVQ PAATQSALWG EHEMDYILFI 

RMLY KAPSDGKWG EHELDYLLFI 

RILY., KAPSDGKWG EHELDYLLFI 

RMLY KAPSDGKWG EHEVDYLLFI 



250 

. . RANVTL APNPDEVDEV 
. . RANVTL APNPDEVDEV 
. .VRDVKV QPNPDEVAEI 
. . IRDVNL DPNPDEVAEV 
VRDVKL QPNPDEVAEI 



RIHY MAPSNEPWG EHEIDYILFY KINAKENLTV NPNVNEVRDF 

251 300 

RYVTQEELRQ MMQ PDN GLQWSPWFRI IAARFLERWW ADLDAALNTD 

RYVTQEELRQ MMQ PDN GLQWSPWFRI IAARFLERWW ADLDAALNTD 

KYVSREELKE LVKKADAGEE GLKLSPWFRL WDNFLMKWW DHVEKGTLVE 
KYMNRDDLKE LLRKADAEEE GVKLSPWFRL WDNFLFKWW DHVEKGSLKD 
KYVSREELKE LVKKADAGDE A VKLS PWFRL WDNFLMKWW DHVEKGTITE 
KWVSPNDLKT MF ADP SYKFTPWFKI I CEN YLFNWW EQLDDLSEVE 

301 

KHEDWGTVHH INEA* 

KHEDWGTVHH INEA* 

A. IDMKTIHK L* 

A. ADMKTIHK L* 

A . ADMKTIHK L* 

NDRQ . . . IHR ML* 
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CaMMMi W e«a«= cg c =at gggtgac 

SI g ac t =c ggca tggatgctgt tca g c g a Cg * c tMtgtttg aogatgaatg 
101 c att « ggtg gatgagtgtg acaatgtggt „ gacatgat aoMMtaca 
151 attgC = a o« gatggagaag attgaaacag gcacagagca 
«= agcg « t ttctattca . « caaaata= gagtta=ttc ttMgcaa=9 
«i g t « g c aa == aaggtgacat « cct « agt atggaccaac acctgttgca 

301 g c=at=ca= t « acagagaa t== gagcttg « cccgaaac gcctgagaga 

351 a t g « gca = a gaggaxx3aot xxxxxxxxxx XMODOOOQO[ IODDDOaooQC 

"I XXXXXXXXXX xxxxxxxxxx xxxx ^ naoac xxxxxxxxxx 

451 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX xxxxxxxx ^ XXXXXXXXXX 

501 XXXXXXXXXX XXXXXXXXXX xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

551 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

601 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

*51 xxxxxxxxxx xxxxxxxx« xxxxxxxxxx tct^, aagggtacac 

701 tca« g aa tg = aat « ga t a t g aaaa=cat acacaag « g atatagaaac 

'51 acccctca, c= gaaaagca agcc<:aataa tt=gggttgg 9txw . etM 

801 == a tc aaftg «««t=« « aacaaet t ttaa t«=„ « tgagcatg 

a 5i « ga «=« g tc «t tgtgt gtaagatt « ggg « tcg « tcagt):gtaa 
901 ta.tg.acc. tt g a tgg «t g =a«ttcaa g «== t at= g acat^agt, 
951 atctaaaaaa 
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Plane bac* 
Constnjuf 



Plane bmts 
A-e. epsilon 
Conj«niui 



Plane b«ca 
A. c. ep#iion 
Cotuansuj 



Plane b«ca 
A.c. ep*iloa 
Con««niuj 



Plane b*»ca 
Cona«naua 



Plane b«ea 
A-e. epsilao 
Cani«niua 



Plane boea 
A.c. cpaxlon 
Cona«nju< 



Plane b«ca 
A.C. epsiloa 
Conj«nsua 



™? — » s^ua* sass 

• . g . m ' 

71 Cyano&acterial enzyme begins ^ 

~sffir- ssesss; ssssb ppiiiS^ skss* gtasr" ««*™s 

VK-.. S .. L . .v 0 - »«*B*L DHWTCOSPA GXAIAAESAZ LCXXVGHCP 

• " L S - P--V-G-CPA GIA-A, CL-V--I-B 

^^Po«,b.e suounit interaction domain f Oinu<^d»*inding mature > 

.•PKLXWPKN YCVWVD8FSA HDLLCCLDaT HSC*.w<f^ . ^ 210 4 

OLP . . . FTHH YGVWZDSFHD LcZSigSj D»mc£2 SSS? 3 " 3 W»«X-»S 

hh tcvw-o«r ~ ~l~c.— S^jJ!^ ^f"^ 

Conserved region #1 c— e 

211 ' 
VXFHqaXVik' ViHB.B-kSm llCaOG-elQ AeWUULTTSF cd ^ 2*0 

vs^kvos xtiasoglax. vxooHHvx? SZSSaIca SciSSS ^SS 

Co—I- ...... x . a ^ :„. g . A «.„„... 

PPD--»«Vrw DWROsHL-rm -aUCXUfa-1 PTfl.YAMPvg «..-_. 3 SO 

p p " ^ SS I:;" EE? ^S.?!? 5 ? 

♦ Conserved region #2 4 Conserved region *3 ^ *** 

351 " 1 

•HLCZkVXal BKDKnCvIFM GCpUVlpOR WClCCTifl. mn»w 4 20 

— — = sags 

Conserved region #4 Predicted TM helix " 

421 

ii^^S^ SAmWkDIM ISSMQRS7F CPGMDIIXJCL OLnATRJUFD AFTDLaSrvv 

"XOIHSH.X SROWOTLW PBUJOtQIIAJT LFGXALXVQF 5?SSS^ ^SJ^^SI 

-•-V.-.r— S--- W--LWP -Stt-ttQR-FF -FG 0 .— R-FF- -FF-L— -W 

♦ Conserved region #5 ^. 

481 

KGFLSSKLTX. PKUvFCUX. FShASWTSR- KSfR ct.d r« mtwt^i^ £ 33 
QCPLSSTLTS GDLVt.PAX.YM FvSfSSx £S5lSdS £aS£!£2 
-CFL-S-l.-- — L--F-L-- P n- . a- 

Predicted TM helix 
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Adonis palaescina E-cyclase cONA #5 Length: 1398 

1 aaaggagtgt tctattaatg ttactgtcgc attcttgcaa cacttatatt 

j sssss mtiizii v—t- sssb d 

Ill SSS22 Sisssr sssss iSSSS 



251 
301 
351 
401 
451 



ataacataca tcgatatggt tcttcttg^ ga^a|at" ^glfaoa 
SSf52 gaagc< ^ ag tagaagttct ?t?gc?tata afgajg???? 
tgtggatgaa gaggatctca tcaaagctgg tggttctcaa etr??»?^S 
JESS" * Caaacaaa * tctatlgaf? alcaggc?al gctcgJcJa? 

4 V aa 2^ CaC caatacct « tggagaatcc gtgatggact tgctSSIt 
451 aggttgtgga cctgctggtc tttcactggc tgcagaagct octaaS^ 
501 ggttgaaagt tggccttatt ggtcctgalc t?cct??Sc Ha^tta? 
551 ggtgtgtggg aagacgagtt caaagaLtt ggacttgatc 

' III g rrtt? CZ t gg aa ^acacca tcgtatatct ??ataa?gat 

ill ««iJ" tC9 Cgcatat ^ cgagttagtc gacatttjct HaSafgal 
ill S?f 93333 ggtgt ^9ga gccaggtgta tcatatclgg aScflflcI 
751 ggaaaggatc actgaagctg gtgacggcca tagccttgla cteSXlS 
ill !cS2 aeCtt * aCcccctgc ^gcttgcta ct|ttgcatc Igg^fclg" 
•90i tatlllttlt «" gWt * ^aagtaggr ggccclcgtg tttJtroS 
951 ! gggg^ggagg ttgaggtgga gaacaatcca tacgatccca 

100? a «caatggt attcatggac tacagagact atatgcaaca gaalttUac 
1001 tgctcggaag aagaacaccc aacacttctC tatgtcatgc Ictllltatl 
cn"^" "5"tg.gg aaacctgttt ggcctcaala galglcalgc 
1151 f a ™ a rf act 9 a agaga aaactgatgt cacgattgaa gac£ctgg?t 
1201 a 5c? aa ?^ a caaaa 9 ttta tgaagaggaa tggtcataca ttcctg??fg 
1201 tggttcttta ccaaacacag agcaaaagaa cctagcattt ggtgccocaa 
1251 caagcatggt gcacccagca acaggctatt cggtfgtacg J?cI««S 

5 I"?"" 33 aatat 9Cttc tgtaattgca aagatftSgl fgcaagltaa 
llll SS?? 9 *" 9t ^tttctg gacaaagtag tgcagtaalc atttcScS 
lill titbit?? cagtcttc 9g ccaaaggagc gaaaacgtca aagagcttTc 
llll gaCtagag « tattgtgcag ctagatattg aagclaccag 
llll "**" ct " ^aaccttct tccgcttgcc aacttggatg tggtggggj? 
1551 tccttgggtc ttcactatca tctttcgatc tcgtcttctt ttccltalac 
"Si tggcgccaaa cagcatgagg atgtcac?^ Jgagacl^t 
ifnf Z i ccttctggtg cagttatggt aagagcttac ctcgaaaggt 
1701 agtctcatct attattaaac tctagtgttt caccaaataa atgiggatlc 
1801 Ullatir? tatacgatca tctctatgta tatcctgtac tc?allct« 
llll '? aagCaaat gccgggtttg atatcgctgt gtcaaaccgg ccaatgatat 
1851 aaagtaaatt tattgataca aaagtagttt ttttccttal aaaaaala 

^?,i S P* laest:j - n a e-cyclase #5 predicted polypeptide 
TRANSLATE from: 113 to: 1702 Length: 529 amino acids 

*\ c^ooY™ 1 * 1 SSCPVW TfGT RNLSSSKLAY NIHRYGSSCR VOFQVRADGG 

51 SGSRSSVAYK EGFVDEEDFI KAGGSELLFV QMQQTKSMEK QAKLADKLPP 

1*1 lll GZS ™ DL ^IGCGPAGL S LAAEAAKLG L^GLIGPOL ?™?g5Se 

151 DEFKDLGLER CIEHAWKDTI VYLDNDAPVL IGRAYGRVSR HLLHEElSX 

201 CVESGVSYLD SKVERITEAG DGHSLWCEN EIFIPCRLAT VASGAASGKL 

251 LEYEVGGPRV CVQTAYGVEV EVENNPYDPN LMVFMDYRDY MQqSScsS 

111 ^n. LYVMP MSPTR ^EE TCLASKDAMP FDLLKRKLMS Sl^S 

^51 KVYEEEWSYI PVGGSLPNTE QKNLAFGAAA SMVHPATGYS WRSLSEAPK 

401 YASVIAKILK QDNSAYWSG QSSAVNISMQ AWSSLWPKER KRQRAfSfS 

451 LELIVQLDIE ATRTFFRTFF RLPTWMWWGF LGSSLSSFDL VLFSMYMFVT 

501 APNSMRMSLV RHLLSDPSGA VMVRAYLER* i " i>tDt ' V L FSMYMFVL 
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ONA sequence of poi o cONA (GenBank R27 54 5) obtained from Nicholas J. Provart 

potato. seq Length: 1379 August 2, 1996 13:06 Type: N Check: 605 

1 tagcggnhnn naggatgagt tcaaagatct tggtcttcaa gcctgcattg 

51 aacatgtttg gcgggatacc attgtatatc ttgatgatga tgatcctatt 

101 cttattggcc gtgcctatgg aagagttagt cgccatttac tgcacgagga 

151 gttactcaaa aggtgtgtgg aggcaggtgt tttgtatcta aactcgaaag 

201 tggataggat tgttgaggcc acaaatggcc acagtcttgt agagtgcgag 

251 ggtgatgttg tgattccctg caggtttgtg actgttgcat cgggagcagc 

301 ctcggggaaa ttcttgcagt atgagttggg aggtcctaga gtttcrgtrc 

351 aaacagctta tggagtggaa gttgaggtcg ataacaatcc atttgacccg 

401 agcctgatgg ttttcatgga ttatagagac tatgtcagac acgacgctca 

451 atctttagaa gctaaatatc caacatttct ctatgccatg cccacgtctc 

50i caacacgagt ctttttcgag gaaacttgtt tggcttcaaa agatgcaatg 
551 " ccattcgatc tgttaaagaa aaaattgatg ttacgattga acaccctcgg 

601 tgtaagaatt aaagaaattt atgaggagga atggtctxac ataccagttg 

651 gaggatcttt gccaaataca gaacaaaaaa cacttgcatt tggtgctgct 

701 gctagcatgg ttcatccagc cacaggttat tcagtcgtca gatcactgtc 

751 cgaagctcca aaatgcgcct tcgtgcttgc aaatatatta cgacaaaatc 

801 atagcaagaa tatgcttact agttcaagta ccccgagtat ttcaactcaa 

851 gcttggaaca ctctttggcc acaagaacga aaacgacaaa gatcgttttt 

901 cctatttgga ctggctctga tattgcagct ggatattgag gggataaggt 

951 catttttccg cgcgttcttc cgtgtgccaa aatggatgtg gcagggattt 

100 1 cttggttcaa gtctttcttn agcagacctc atgttatttg ccrtctacat 

1051 gtttattatt gcaccaaatg acatgagaag aggcttaatc agacatcttt 

1101 tatctgatcc tactggtgca acattgataa gaacttatct tacatrttag 

1151 agtaaatticc tcctacaata gttgttgaan nagaggcctc attacttcag 

1201 attcataaca gaaatcgcgg tctctcgagg ccctgtatat aacattttca 

1251 ctaggttaat attgcttgaa taagttgcac agtttcagtt tttgtatctg 

1301 cttctttttt grccaagatc atgtattgan ccaatttata tacattgcca 

1351 gtatatataa attttataaa aaaaaaaa 

MtM , oeo Lenath: 378 TRANSLATE from: 14 to: .1147 

poteps.pep Lengtn CI£HVWRDTI vv^DDDDPIL IGRAYGRVSR HLLHEELLKR 

51 CVEAGVLYLN SKVORIVEAT NGHSLVECEG DWIPCRFVT VASGAASGKF 

101 LQYELGGPRV SVQTAYGVEV EVONNPFDPS LMVFMDYRDY VRHDAQSLEA 

1S1 KYPTFLYAMP MS PTRVFFEE TCLASKDAMP FDLLKKKLML RLNTLGVRIK 

201 EIYEEEWSYI PVGGSLPNTE QKTLAFGAAA SMVHPATGYS WRSLSEAPK 

251 CAFVLANILR QNHSKNMLTS SSTPSISTQA WNTLWPQERK RQRSFFLFGL 

301 ALILQLDIEG IRS FFRAFFR VPKWMWQGFL GSSLSXADLM LFAFYMFIIA 

351 PNDMRRGLIR HLLSOPTGAT LIRTYLTF* 

FIGURE 15A 



Chimeric lemice/potato lycopene e-cyclase: converts lycopene to o-carotene, the lettuce cdna 

converts lycopene to e-carotene and the potato cONA does not produce an active enz>Tae 

(amino acids in lower case arc from lettuce and those in uppercase are from ihe potato cDNA: an Avail site in common 10 the 
two cDNAs was used to construct the chimera) 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 



meefgarnmt 
kesaksdrev 
eklaqipign 
ygvwqdef ig 
ellrrcvesg 
asgkf leyel 
QSLEAKYPTF 
GVRIKEIYEE 
SEAPKCAFVL 
FLFGLALILQ 
MFIIAPNDMR 



atmavftcpr 

vdkqgisvad 

cildlwigc 

lglegciehs 

vsylsskver 

gGPRVSVQTA 

LYAMPMSPTR 

EWSYIPVGGS 

ANILRQNHSK 

LDIEGIRSFF 

RGLIRHLLSO 



f tdcnirhkf 

eedyvkaggs 

gpaglalaae 

wkdtlvyldd 

iteapngysl 

YGVEVEVONN 

VFFEETCLAS 

LPNTEQKTLA 

NMLTSSSTPS 

RAFFRVPKWM 

PTGATLIRTY 



sllkqrrf tn 
elf f vqmqrt 
saklglnvgl 
adpirigray 
iecegnitip 
PFDPSLMVFM 
KOAMPF0LLK 
FGAAASMVHP 
ISTQAWNTLW 
WQGFLGSSLS 
LTF* 



lsassslrqi 

ksmesqskls 

igpdlpf tnn 

grvhrdllhe 

crlatvasga 

OYRDYVRHDA 

KKLMLRLNTL 

ATGYSWRSL 

PQERKRQRSF 

XAOLMLFAFY 
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GAP comparison of Arabidopsia E -cyclaae x potato e-cyclase (partial) 

blosum62.cmp Gap Weight: 12 Average Match: 2 912 

Length Weight: 4 Average Mismatch: -2 003 

Quality: 1485 Length: 529 

Ratio: 3.929 Gaps: L 

Percent Similarity: 79.893 Percent Identity: 76.139 
Match display thresholds for the alignment (a ) : I - IDENTITY • - 2 



351 
200 



- 1 



151 EOEFNDLGLQKCIEHVWRETIVYLDDOKPITIGRAYGAVSRRLLH^ELLR 200 
, 111 11,11 ' I I I I I I : | | | | | | j | | | | | || I I I U I | | | | | | I • 
1 •DEFKDLGLQACIEHVWROTIVYLDODOPILIGRAYGRVSRHI.LHEELLK 4 9 

201 RCVESGVSYLSSKVDSITEASDGLRLVACDONNVIPCRLATVASGAASGK 250 

M II • I I I I . I | | | | | | . . | || | ; . | | | | I I I | I I I | I I I 

50 RCVEAGVLYLNSKVDRI VEATNGHSLVECEGDWI PCRFVTVASGAASGK 99 

251 LLQYEVTCPRVCVQTAYGVEVrVXNSPYDPDQMVFMDYRDYTNEKVRSli 300 
» I I I . I I II I I I I I I I I I I I I : I . I : | | I I I I | | | | | III 

100 FI.QYELGGPRVSVQTAYGVEVEVDNNPFDPSLMVFMDYRDYVRHDAQSLE 149 
301 AEYPTFXYAMPMTKSRLFFEETCIJVSKDvMPFOLI.KTKI^IJlLDTLGIRi 350 

, e „ 1 • 1 I I I I I I I I I . . | . | || | | | | | | | | | | | | | | | | | | | | , | | | - | I 

150 akyptfxyampmsptrvffeetcijvskdampfdli^lmlrLstlg^ri 199 

LKTYEEEWSYIPVGGSLPNTEQKNIAFGAAASMVHPATGYSVVRSLSEAP 4 00 
•• 1 1 < I I I I I I I I I I | | | | | | | I I I I I I I I I I I I | | | | | | I | | | I | I i 

KEIYEEEWSYIPVGGSLPNTEQKTLAFGAAASMVHPATGYSVVRSLSEAP 24 9 

401 KYASVIAEILREETTKQI NSNISRQAWDTLWPPERKRQRAFFXFG 44S 

' 1 1:1 ' 1 1 : - ' - • I I I I I . | | | | I | | | | | Mill 

250 KCAFVLAN I LRQNHS KNMLT SS ST PS I STQ AWNTLW PQERKRQRS FFLFG 299 

446 LALIVQFDTEGIRSFFRTFFRLPKWMWQGFLGSTLTSGDLVLFALYMFvi 495 

Jii ' • 1 1 """I' I M . I I I I I I I I I I I . I . I I .| : , 

300 LAX.ILQLOIEGIRSFFRAFFRVPKWMWQGFXGSSLSXADLMI.FAFYMFII 34 9 

4 96 S PNNLRKGLINHLI S D PTGATMIKTYLKV 524 

• I I . : I : I I I I I : I I I I I I I : I : I I I 
350 APNDMRRGLIRHLLSDPTGATLIRTYLTF 378 
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Adonis palaestina Ipil 

1 attcaccttc agcagcgctg tcgtactctt tctatatctt cttccatcac 



101 ^?f» 9 ^ gg ! ^? aaC f? gCC accc 9 c <=tca acgccaacta 

1S1 _ _ _ _ 

201 tgactccaaa tacaactgtc atttgatgga aaagatagag gcagaaaatt 



tgggtgaagt cactgatgct ggaatggatg ctgttcagaa gcggctcata 
1S1 ttcgacgacg aatgtatttt ggcggatgag aatgacaagg tcgtcgggca 



251 tgcttcacag agccttcagt gttttcttgt tcaactcaaa atatgaatto 

351 
401 

451 
501 
551 

601 gatgaagttg ctgatgctaa gratgctaac c^cgagilgt tgagagagat 



,ci ctt <="cagc aacgatccgc cacaaaggta acattcccgc tcgtatgcac 

401 
451 
501 

551 tggactatct cctatttatt gtccgagatg tgaaatlcga tc^aaaccca 



— -» — — — — — - w w u v_ w u o (— 

aaacacatgt tgcagtcatc ctctctttcg tgattccgag ctcatagaag 

401 aaaattatct cggtgtacga aacgctgcac aaagaaagct tttagacgaq 

451 ctaggcattc cagctgaaga tgtcccagtt gatgaattta ctcctcttgg 

501 tcgcactctt tacaaagctc catctgacgg caaatgggga gagcacgaat 



651 
701 



actgagaaaa gctgatgctg gtgaagaggg . actcaagttg tctccttggt 
t^agatcggt tgttgataac tttttgttca agtggtggga tcatgtagag 
751 cagggtacga ttaaggaagt tgctgacatg aaaactatcc acaagttgac 
801 ttaagaggac ttctctcctc tgttctacta ttt;gttcttt gctacaataa 
851 gtgggtggtg ataagcagtt tttctgtttt ctttaattta tggccctcga 
901 atttgcctcg atgttgaact tgtaacatat ttagacaaat atgagacctt 
inn- ^taagttgaa tttgaggctg aatttatatt tttgggaaca taataatgtt 
1 uu i aa 



FIGURE 17A 



Adonis palaestina Ipi2 

1 ttttaaagct ctttcgctcc accaccatca aagccagcca aatttctcta 

101 ccttatt?^ " aaacacc ^ "ttgggctt tggcccctcc "atcggaX 

101 ccttgtttac gatacgcatc taaaccagta attctcggtt ttaattrnrr 

151 tcctaaatta ggcccctttc cggaatcccg agalttl?gt SSSS 

201 gattaatcct ttatatagta tcttctccal clccactafa alalfatca? 

251 cttcgtgttc ttctcccgct gttcatcttc agcagcgttg tcgtactct? 

301 tctatttctt cttccatcac taacagtcct cgccgaggg? tglatcggc* 

4oJ SJ3S tCa acgtC ^ acta tgggtgaagt cgct?a?glt gl^ggllg 

401 ccgtccagaa gcggcttatg ttcgacgatg aatgtatttt ggtggffgaf 

501 tlllt?t* gg tc 9 tc ^ aca tgattccaaa tacaactgtc I?t?gat?gl 

501 aaagatagag gcagaaaact tgcttcacag agccttcagt gttttcttat 

601 ac^S 333 atacga 9«g cttcttcagl alcgatctfc Kcgaagg" 

601 acattcccgc tcgtatggac aaacacctgt tgcagccatc ccctcttcco 

651 tgattccgaa ctcatagaag aaaattttct cggggtacga aacg"gcac 

701 aaaggaagct tttagacgag ctaggcattc ca|ctgaaga cgSccIg?? 

751 gatgaattca ctcctcttgg tcgcattctt tacaalgc?c 2i«£2£ 

III tt~l? qqqa ? agCaC9aaC ^gactatct tctgtttatt gtclglgalj 

901 c™^ 3 ^ ccaaaccca gatgaagttg ctgacgctaa Itacgttall 

901 cgcgaggagt tgaaagagat actgagaaaa gctgatgcag gtgaagaggg 

951 aataaagttg tctccttggt ttagattggt tgtggataac tttttStS 

1001 agtggtggga tcatgtagag gaggggal|a tLIggacgt cgccglcato 

lioi a c aa ^^ CC acaag " gac "aagagaaa gtctcltaag t?ct2cSt? 

1101 tggtttttgc ttcaataagt ggatggtgat gagcagtttt tatgcttcct 
ttaattttgg cttttcaact tgctttatgt gttgaacttg taacatattt 

aatcaaatat nana^rrp n f Z ' . 



1151 
1201 



1201 agtcaaatat gagaccttgt gagttgaatt tga^ttata ttt^ttJ 
1251 tgggaacata aaaaaaaaaa y " 
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fiaemacococcus pluvialis Ipil 

si tiling?, ?f«n,"i s t c ""n: 9 c„ca, t =c 
s| ~g iiH sssss 

20 =tgtc,c,=g ccgagg.cc, ealagaccac a?,.,,2«g laaoS^' 

ssssss 35ss :sssg si B FF 

S£S£ ssds ~ 2 !! F 

gctcaaaaat caccttccca agtgtgtgga SaS2^« ca 9 c St<3<=ac 
cctttacatg ggcagacccc aga?glg|?g gtccaactal alt^^ 
cgacggaaca gtacctggcg caaaggctgc IJeelSS^ <? cc *<?<?tggc 
acgagctggg gatacclgcg cacclgltgl SgcaaScJc a*?r '" agC 

? HHiS SI Si i#i iili 

801 gtgacgcaag aggljclgcg ge.gaSt? iSS , * C ** "'"We 
atggtcgcc, clftlcclca £a!*,2c,I JaSSJ*** 

aiSi ...I. 1 "" ""l.2£ 



451 
501 
551 
601 
651 



851 
901 
• 951 
1001 
1051 



FIGURE 18A 



Haematococcus pluvialis Zpi2 

1 tggaacctgg cccggcggca gtccgatgcc gcgatgcttc ottcni-^t 

j sssss sssss ssss: issss i i 

££££ -f-fs 

251 acgacaacat cacaggccat gccagcaagc tglagtgcca lalfetl^ 

301 ccacatcagc ctgcaggcct gctglaccgg gl?t?c?ltg 

351 tgacgaccag gggcgactgc tgctgcaall gcgtgcacg? tSJSiSI 

■ SSSES 3SS55 =~g Hi 

ill* s3=»SSS! sSbs 

g ,S2 sssss SSSS a ?!.« 

701 tggcgcccaa ccctgacgag gtggacgaag tcaggtacgt gacgcfaafo 

8oi a 9"^tgca gccggacaac gggtljSSX ggtlgccStg 

801 gtttcgcatc accgccgcgc gcttccttga gcgttggtgg JctSeeSf 

851 acgcggccct aaacactgac aaacacgagg Jclggffall ggt?l«cI1 

111 tatnZttt? C ^ t 9aaggca gaagctgcag gatHglaga Ilcglcatgg 

1001 lllllllll q ^tacttggc agcttcgcac ctcctlcclc tgagalJgal 

1051 ttlllllV** Ca 9 a 9tcaac ggtgcatcat attcaccgtc tltcttt?!? 

llll ttlllltlV C «* ta * cta W«.ctg. tgaaccc?tt acaacttSa 
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Laccuca sativa Ipil' 

si* c?t"ccatf tcI"ccc C tl "c«ccc?c a3CCat ^« accacgatct 
101 atgccgtctc tcgcagccgl Cag1g?tCtc I?ccacce^ " tCC « c « 
151 cgctatgggc gaccclagla tgga?g«gl cSaacar " tc ? CCCgC 

i 1 SS SSSffi! -ass ~ S3 

Si £2325 SSS22 ™ ™« 

401 acgcgrcgca gccalccact aca?agggag agtga|c??I J2?S" aC 
451 cgcccccggg gtgaggaatg ccgcallga? gaagc?«tq Itlttl^ 
501 gcatccctgg agcagacgtc ccggctgacg agtccaltcc !rr™ 9 

s -S iPSE SII feS Si 

so: ,.«Ic«SI ?S°lc«" JS«2Si 

Soi «:f;r* e * s^nn? jssss? 

si sssss tttzziit iiziuz: - c ! c «" 
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Laccuca 



sativa Xpi2 
tattcgcttc 
ggtcttaatc 
tgacgattcc 
aatgcatttt 
cacaattgtc 
agcattcagt 
aaaggtcaga 
tgcagccatc 
tggggtcaga 
ctgctgaaga 
tacaaggctc 
actcttcctc 
cggacattag 
gcggatgcgg 
ggtggacaac 
tcaatgaagc 
gttaatattt 
tcggtccttc 
tattgtgtac 
aaagaaccac 



aaaatctctt 
tccgccggcg 
ggcatggacg 
ggttgatgaa 
acttgatgga 
gtatttttat 
aaccaaggtg 
cactatacag 
aatgctgcac 
tgttccagtt 
catctgatgg 
gtgcgtgacg 
atacgtgaac 
gtgaggaggg 
ttcttgttca 
a a ttgaca tg 
atggtggtgg 
ttcttttaac 
ttgtaacgta 
cgttaattta 



ccattaactg 
cactttcacc 
ctgtccagag 
aatgacaatg 
gaagattgag 
tcaattcaaa 
acatttcctt 
agaatcggag 
agaggaagct 
gatgagttca 
aaaatggggt 
ttgccgtgaa 
caagaagagt 
tttgaaattg 
aatggtggga 
aaaaccattc 
tttggagcta 
gttttttttt 
ggccctttgg 
aaaaaaaaaa 



ctcaaatctc 
accataaccg 
acgtctcatg 
ttctcgggca 
aaagataatt 
atacgaatta 
tggtatggac 
ttaattcccg 
tctagatgaa 
caactttagg 
gaacatgaag 
cccaaaccct 
taaaagagtt 
tccccatggt 
tcatgtccaa 
ataagttgat 
ataatttgtg 
tttcttttat 
ttacgcttta 
aaaaaaaa 



caccttcgcc 

ccgccatggg 

tttgatgatg 

tgataccaaa 

tgctccatag 

ctccttcagc 

aaacacctgt 

aaaatgccct 

ctcggtatcc 

tcgcatgttg 

ttgattacct 

gatgaggtgg 

actaaggaag 

ttaggctagt 

aaggggacac 

atgaaa T aat:g~ 

tgttcaagtc 

tgggagtgtt 

agagtttaat 
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# 

Chia/nydomonas reinhardcii Jpii 

*•* 

(Note: the isomerase cDNA probably ends at « h^o- uni * 
1 ggcacgagct cgagtttgtt ttaccatgac atcgggaatt tqoaaactto 

i iiisiiiiiiii 

• 301 faaflrr^ 9 ctctc 9 tc <? c gcgtgtcagc tccgccctgt gglcggglgfr 
3sJ c^ggctcaaa gcgtcgcggt gcgaatggcg HtcIg?cS 

• a || 32352 32S3 "SSS ?c a ~ 

SSSS ~£ I™ J 

III " gcagcgc * cagccagcaa ggtgacglcc ccggglgfg? gjafcalca? 

' III cllZlZlln? Caccc 9«^ cgggccaggc gclggalglg |t£S££ 

701 SISac?S a 9<=«cgggc caggtgccgg gcatcaaggl gglggcgg?g 

7 = 1 ^" a ?" 9C a 9 ca cgagct ggggataccg ccggagcagg ctcccocct? 

Ill ™" CtCC ttcc "acgc gtctgcacta ctgcglcgll galalJgccf 

851 f?catacaa^ ggCggcggag ^ggggcgagc acgalgtgga l t a?5rg«g 

901 " c< ? ca< 3 ca S cc cgtcagcccg cagcccaacc cagalglga? 

951' c ^ ctac 9 t 9 a =gctgccgga gcttcagtcc aefatfglgg 

1001 L^'f ca 9<=tggagc ccctggttcc gcatcctggc calacfgllf 

1051 9C cc 9 cc ^gt:g gggcgacctg aagcggcgct ggcgcclggg 

1051 cggcagccga ctgtcggact ggggcaccat ccaccgcgtc aLIgaaol! 

1151 Caggggcg *9 -flcgggggat gaatgfglat gtgaltgl^ 

1201 «f^ 9Cg gc ^ 1: 9ggatg aggtctgaag acagggggaa LLgg?ggg 

llll flff!r£ 9a9 C 9^9tgtgta cgtgagcgac aaagccggga ggcg?121gl 

JsoJ I^ 9a ! gggta cat 9t:gcgcg cggagggtcg gtgggtcggt Ugllgcglg 

1351 ItaZltallt ttgtgtgtgt gcggctgcgc ggglatglgg glacclgfg? 

1351 acggaggaga aggcacacgc aggtggcgcg gaggtgtgtc aggggclato 

iisJ aaaI?? CCtC actcct ^ Cc gtgcccagtg gtSegtggg d^ggcl 

lill tSSSS ™£ tWC ggc ^ cact ^ cgcgctgfll taag?c?f" 

1551 of a ««" 9 a S9tggggcg aggtggctgt gggcggcggg cgcagtggca 

16M 1*™° 9 ^ t 9 t ^gagc ggtggagctc tggccgtgcl ggccltjlgg 

1651 till™?** ^atgacgt tgtgcttggc cgctgtaatg cjggagaaf? 

1701 ^caggccgc gagaagcggg cggtggcagg aggccgcagg clgLJcacl 

111! ■ allll^l^ ^ cc< 3ccz g caggcgcggc gccgggcgU ccfgagt^t 

llll llltlttT ^tagtggcgg ccacaggagg cgcaggagll agcagla|ga 

llll KSKSS lllllltlll aLT aaccc aagg « gcgc ~ 
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TageCes erecca Ipil 

5 V «»^f aacaa ctcaaacctc ccccgtcgcc cttactccgc cacgggtgac 

51 gactccggca cggatgctgt ccagcgacgt ctcacgtttg acgataaa^c ' 

III It^llll-t gaCgagtgtg a «"gtggr gggacltga? acSaltacI 

151 atwgtcactt gacggagaag attgaaacag gtaaaatgcc gcacagaoca 

251 CCc "" caa ttcaaaatac gagrtacltc ?tcag?a1cg 

251 gtctgcaacc aaggcgacat ttccttcagc atggaccaac acctgttaca 

301 gccatccacc ctacagagaa cccgagcttg ttcccgaaaa cgcc«t?ga 

351 gtaagaaatg ctgcacagag gaagctgttg gatgalctcg g?ae«c?S 

401 cgaagacgct cccgttgacc agtttactcc tttaggtcgl acgctltalS 

451 aggctccacc tgatggaaag tggggagaac atgalcttga ctlcctac" 
W*egttgc Cgtaaacccg aacccaga?g aagtjglgga 

60 1 gtganccang aagagttaaa ggagctrgcta aggaiagclg 

601 atgcggggga ggagggtttg aagctgtctc catggttcag gttagrggt? 

651. .gataacttct tgtccaagtg gtgggatcat gtgcaaaagl gtacactlac 

701 cgaagcaatt gatatgaaaa ccatacacaa gclgatatl? aaaca1a«c 

751 tcaaccgaaa agttcaagcc taataattcg ggttgggtcl ggtctaccS 

801 caattgtttt- tttcttttaa gaagttttaa. t£tcS?tt| Igcatottga' ' 

III llUllltll < Cg ^ tgtaa gattctgggt ttcgtttca? ttgtaJSX 



951 SSI"** tgg " Cgcaa "tcaagttc ctatcgacat gtagrgatct 

FIGURE 21 A 



Oryza saclva Ipil 

cctccctttg 
ggccggcgcc 
agcggctcat 
gttgttggcc 
atctgaaaat 
aatatgaact 
cta'gtttgga 
gcttacacag 
tcttggatga 
acccctcttg 
tgaacacgag 
tcccgaaccc 
ctgaaggagc 
gtctccctgg 
atcacgtcga 
cacaagctga 
aaga'ctctgt 
gaagtcagaa 
tgggctctgc 
gcaaacttct 
agaacaatag 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 
90 
95 
100 



ccrcgcgcag 

gccgccgccg 

gttcgacgac 

atgaatcaaa 

ctacttcata 

cctactccag 

ccaacacttg 

gaaaactacc 

gctgggcatc 

gtcggatgct 

cttgactacc 

ggacgaagtg 

tcatccgcaa 

ttccggctgg 

gaaaggcacc 

agtaaggact 

tcttgtgctg 

gaagcttttg 

tgactgagag 

atattataca 

catgttccac 



aggcggccgc 

tggaggacgc 

gaatgcattt 

atataactgc 

gggctttcag 

caacgatctg 

ctgcagccat 

ttggtgttag 

ccagctgaag 

ttacaaggcc 

tgctgttcat 

gccgatgtga 

agcggacgcc 

ttgttgacaa 

ctcaacgagg 

gcgatgttgt 

ctgcatatta 

tatgtttctg 

attcccttat 

tgattagtta 

gtttatttgc 



gccttctccg 

cgggatggac 

tggtggatga 

catctgatgg 

tgtattcctg 

caacaaaggt 

cctctgtacc 

aaatgctgct 

atgtgccagt 

ccatctgatg 

cgtccgcgac 

aatacgtgag 

ggagaggaag 

cttcctcatg 

ccgtggacat 

ggctggaaag 

ctcttaccag 

ggtttggagc 

agagtgtcta 

attgttcggt 

t 



ccgcgaggat 

gaggtccaga 

acaagacaat 

aaaaaatcga 

ttcaactcaa 

tacatttcct 

gtgagtctga 

cagaggaagc 

tgaccaattc 

gaaaatgggg 

gtgaaggtag 

ccgtgagcag 

gcctgaagct 

ggctggtggg 

ggagaccatc 

aatgatcctg 

ggaagttgca 

ttggaagtgt 

tgttaattta 

gtctgaataa 
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ison using GAP program of the Genetics Computer Groun 

Gap Weight: 50 Average Match: 10.000 

Length Weight: 3 Average Mismatch: 0.000 

Quality: 17392 Length: 1904 

Ratio: 9.411 Gaps . 3 

Percent Similarity: 95.331 Percent Identity 95 331 

Match display thresholds for the alignment (s) : | - IDENTITY 
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FIGURE 23 (cont.) 
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GAP program of Genetics Computer Group 
blosum62.cmp 

Gap Weight: 12 Average Match: 2.912 

Length Weight: 4 Average Mismatch: -2.003 

Quality: 2728 Length: 530 

Ratio: 5.14 7 Gaps: 0 

Percent Similarity: 99.623 Percent Identity: 99.057 
Match display thresholds for the alignment (s ) : | =- IDENTITY : » 2 

Adonis palaestina e-cyclase #3 x Adonis palaescina e-cyclase #5 

1 MELLGVRNLISSCPVWTFGTRNLSSSKLA2fNIHRYGSSCRVDFQVRADGG 50 
M I I I M I I I II I I I I I I I I I I I I 1 I | | | | | | | |' | | | | | | | | | | | i | | | | 

1 MELLGVRNLISSCPVWTFGTRNLSSSKLAYNIHRYGSSCRVDFQVRADGG 50 

51 S G S RT S V A YKEGFVDEEDFI KAGG SELL FVQMQQT KS ME KQ AKLA D KL P P 100 

I I M - I M II I I I I I M I I I 1 I I I I I I I I I | | M | m I I I I I II M I | M ' 
51 SGSRSSVAYKEGFVDEEDFIKAGGSELLFVQMQQTKSMEKQAKLAOKLPP 100 

101 IPFGESVT4DLVVIGCGPAGLSLAAEAAKLGLKVGLIGPDLPFTNNYGVWE 150 
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F'IGURE , 28 

GAP of: Arabidopsis epsilon cyclase to Lettuce epsilon cyclase 

Gap Weight: 12 Average Match: 2.912 

Length Weight: 4 Average Mismatch: -2.003 

Quality: 183 7 Length: 534 

Ratio: 3.499 Gaps: 3 

Percent Similarity: 76.381 Percent Identity: 69.905 

Match display thresholds for the alignment (s) : 
I = IDENTITY 
: = 2 
. = 1 

Arabidopsis x Lettuce 

1 MECVGARNF.AAMAVSTFPSW. . . SCRRKFPWKRYSYRNIRFGLCSVRA 46 

III Mil I I I I I I - • | I I . . I . : I : 
1 MECFGARNMTATMAVFTCPRFTDCNIRHKFSLLKQRRFTNLSASSSLRQI 50 

47 SGGGSSGSESCVAVREDFADEEDFVKAGGSEILFVQMQQNKDMDEQSKLV 96 

' I I I I I : I I I I I I I : I I I I I • I I : I I I I 

51 KCSAKSDRCVVDKQGISVADEEDYVKAGGSELFFVQMQRTKSMESQSKLS 100 

97 DKLPPISIGDGALDHWIGCGPAGLALAAESAKLGLKVGLIGPDLPFTNN 14 6 

, !, l > I'- H IIIMIIIIIIIMIMIMI IIIIIIIIIMII 

101 EKLAQIPIGNCILDLVVIGCGPAGLALAAESAKLGLNVGLIGPDLPFTNN 150 

* " • 

147 YGVWEDEFNDLGLQKCIEHVWRETIVYLDDDKPITIGRAYGRVSRRLLHE 196 

HI Mil llh I I I I |::|:||||| || MMIIII I Mil 
151 YGWQDEFIGLGLEGCIEHSWKDTLVYLDDADPIRIGRAYGRVHRDLLHE 2 00 

197 ELLRRCVESGVSYLSSKVDSITEASDGLRLVACDDNNVIPCRLATVASGa' 246 
, II II I INIII II Mllh MM -I h |: I I II MINI I II 
2 01 ELLRRCVESGVSYLSSKV^RITEAPNGYSLIECEGNITIPCRLATVASGA 2 so 

247 ASGKXLQYEVGGPRVCVQTAYGVEVEVENSPYDPDQMVFMDYRDYTNEKV 2 96 

„ 1 i'i 1 s 1 1 - 1 1 1 1 1 n 1 1 1 1 1 * 1 1 1 1 1 1 . 1 1 1 1 1 1 1 111 mm- 1 

251 AS GKFLE YELGGPR VCVQTAYG I E VE VENNP YDPDLMVFMD YRDFS KHKP 3 00 

297 RSLFAEYPTFLYAMPMTKSRLFFEETCLASKDVMPFDLLKTKLMLRLDTL 346 

im-MIMI I |. .::|MMMM = : MI-MI-IM II : 
301 ESLEAKYPTFLYVMAMSPTKIFFEETCLASREAMPFNLLKSKLMSRLKAM 350 

347 GIRILKTYEEEWSYIPVGGSLPNTEQKNLAFGAAASMVHPATGYSVVRSL 3 96 

Nil : 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

351 GIRITRTXEEEWSYIPVGGSLPNTEQKNLAFGAAASMVHPATGYSVVRSL 4 00 
3 97 SEAPKYAS VIAEILREETTKQINS NI S RQAWDTLWP PER KRQRAF 441 

401 UiLiJ;il]iiJiJLL:il-.-I Q!a! J Mill I M n n I 



SEAPNYAAVIAKILRQDQSKEMISLGKYTNISKQAWETLWPLERKRQ^ 



450 



* • 

442 FLFGLALIVQFDTEGIRSFFRTFFRLPKWMWQGFLGSTLTSGDLVLFALY 4 91 

lllll- N I H I • I I I ! I I I I I I I ! I HIM. |. I ||::|||| 
451 FLFGLSHIVLXDLEGTRTFFRTFFRLPKWMWWGFLGSSLSSTDLIIFALY 500 

492 MFVISPNNLRKGLINHLISDPTGATMIKTYLKV* 525 

Mll-l-.ll |: I I : I M I I I I I H || :| 
501 MFVIAPHSLRMELVRHLLSDPTGATMVKAYLTI* 534 
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SEQUENCE LISTING 

<110> CUNNINGHAM JR., FRANCIS X. 
SUN, ZAIREN 

<120> GENES OF CAROTENOID BIOSYNTHESIS AND METABOLISM AND ■ 
METHODS OF USE THEREOF 

<130> 8172-9023 

<140> NOT YET ASSIGNED 
<141> 1999-06-02 

<150> 09/088,724 
<151> 1998-.06-02 

<150> 09/088,725 

<151> 1998-06-02- . . • . ■ 

<160> 61 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 1860 

<212> DNA .\ ' ' 

<213> Arabidopsis thaliana 

<220> 
<221> CDS 

<222> (109) . . (1680) , 
<400> 1 

ACAAAAGGAA ATAATTAGAT TCCTCTTTCT GCTTGCTATA CCTTGATAGA ACAATATAAC 60 

AATGGTGTAA GTCTTCTCGC TGTATTCGAA ATTATTTGGA GGAGGAAA ATG GAG TGT 117 

Met Glu Cys 
1 

GTT GGG GCT AGG* AAT TTC GCA GCA ATG GCG GTT TCA ACA TTT CCG TCA 165 
Val Gly Ala Arg Asn Phe Ala Ala Met Ala Val Ser Thr Phe Pro Ser 
5 10 15 

TGG AGT TGT CGA AGG AAA TTT CCA GTG GTT AAG AGA TAC AGC TAT AGG 213 
Trp Ser Cys Arg Arg Lys Phe Pro Val Val Lys Arg Tyr Ser Tyr Arg 
20 25 30 35 

AAT ATT CGT TTC GGT TTG TGT AGT GTC AGA GCT AGC GGC GGC GGA AGT 2 61 

Asn lie Arg Phe-Gly Leu Cys Ser -Val Arg -Ala Ser Gly Gly Gly Ser 
4 0 4 5 50 

TCC GGT AGT GAG AGT TGT GTA GCG GTG AGA GAA GAT TTC GCT GAC GAA 30 9 

Ser Gly Ser Glu Ser Cys Val Ala Val Arg Glu Asp Phe Ala Asp Glu 
55 60 65 

GAA GAT TTT GTG AAA GCT GGT GGT TCT GAG ATT CTA TTT GTT CAA ATG 357 
Glu Asp Phe Val Lys Ala Gly Gly Ser Glu lie Leu Phe Val Gin Met 
70 ^ " 75 80 

CAG CAG AAC AAA GAT ATG GAT GAA CAG TCT AAG CTT GTT GAT AAG TTG 4 05 



\ 
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Gin Gin Asn Lys Asp Met Asp Glu Gin Ser Lys Leu Val Asp Lys Leu 

♦85 90 95 

CCT CCT ATA TCA ATT GGT GAT GGT GCT TTG GAT CAT GTG GTT ATT GGT 4 53 

Pro Pro lie Ser lie Gly Asp Gly Ala Leu Asp His Val Val lie Gly 
100 105 110 115 

TGT GGT CCT GCT GGT TTA GCC TTG GCT GCA GAA TCA GCT AAG CTT GGA 501 

Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala Lys Leu Gly 
120 125 - 130 

TTA AAA GTT GGA CTC ATT GGT CCA GAT CTT CCT TTT ACT AAC AAT TAC 54 9 

Leu Lys Val Gly Leu lie Gly Pro Asp Leu Pro Phe Thr Asn Asn Tyr 
135 140 . 145 

GGT GTT TGG GAA GAT GAA TTC AAT GAT CTT GGG CTG CAA AAA TGT ATT 5 97 

Gly Val TrD Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin Lys Cys lie 
150 155 160 

GAG CAT GTT TGG AGA GAG ACT ATT GTG TAT CTG GAT GAT GAC AAG CCT 64 5 

Glu His Val Trp Arg Glu Thr lie Val Tyr Leu Asp Asp Asp Lys Pro 

165* 170 175 

ATT ACC ATT GGC CGT GCT TAT GGA AGA GTT AGT CGA CGT TTG CTC CAT 693 

lie Thr lie Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg Leu Leu His 
180 185 190 195 

GAG GAG CTT TTG AGG AGG TGT GTC GAG TCA GGT GTC • TCG TAC CTT AGC 741 

Glu Glu Leu Leu Arg Arg Cys Val Glu Ser Gly Val Ser Tyr Leu Ser 
200 205 210 

TCG AAA GTT GAC AGC ATA ACA GAA GCT TCT GAT GGC CTT AGA CTT GTT 78 9 

Ser Lys Val Asp Ser lie Thr Glu Ala Ser Asp Gly Leu Arg Leu Val 
215 220 225 

. GCT TGT GAC GAC AAT AAC GTC ATT CCC TGC AGG CTT GCC ACT GTT GCT 8 37 

Ala Cys Asp Asp Asn Asn Val lie Pro Cys Arg Leu Ala Thr Val Ala 
230 235 240 

TCT GGA GCA GCT TCG GGA AAG CTC TTG CAA TAC GAA GTT GGT GGA CCT 885 

Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val Gly Gly Pro 

245 250 255 

AGA GTC TGT GTG CAA ACT GCA TAC GGC GTG GAG GTT GAG GTG GAA AAT 933 

Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val Glu Asn 
260 265 270 275 

AGT CCA TAT GAT CCA GAT CAA ATG GTT TTC ATG GAT TAC AGA GAT TAT 981 

Ser Pro Tyr Asp Pro Asp Gin Met Val Phe Met Asp Tyr Arg Asp Tyr 
280 285 290 

ACT AAC GAG AAA GTT CGG AGC TTA GAA GCT GAG TAT CCA ACG TTT CTG 1029 

Thr Asn Glu Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro Thr Phe Leu 
295 300 305 

TAC GCC ATG CCT ATG ACA AAG TCA AGA CTC TTC TTC GAG GAG ACA TGT 1077 

Tyr Ala Met Pro Met Thr Lys Ser Arg Leu Phe Phe Glu Glu Thr Cys 
310 315 ^ 320 

TTG GCC TCA AAA GAT GTC ATG CCC TTT GAT TTG CTA AAA ACG AAG CTC 1125 

Leu Ala Ser Lys Asp Val Met Pro Phe Asp Leu Leu Lys Thr Lys Leu 



2~ 
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325 " 330 ''V ■ 335 

ATG TTA AGA TTA GAT ACA CTC GGA ATT CGA ATT CTA AAG ACT TAC GAA 117 3 

Met Leu Arg Leu Asp Thr Leu Gly lie Arg He Leu Lys Thr Tyr Glu 
340 345 35.0 355 

GAG GAG TGG TCC TAT ATC CCA GTT GGT GGT TCC TTG CCA AAC ACC . GAA 1221 
Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser Leu Pro Asn Thr Glu 
360 365 370 

CAA AAG AAT CTC GCC TTT GGT GCT GCC GCT AGC ATG GTA CAT CCC GCA 12 69 

Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His Pro Ala 
375 380 385 

ACA GGC TAT TCA GTT GTG AGA TCT TTG TCT GAA GCT CCA AAA TAT GCA 1317 
Thr Gly Tyr Ser J/al Val Arg Ser Leu Ser Glu Ala Pro -Lys Tyr Ala 
390 395 400 

TCA GTC ATC GCA GAG ATA ■ CTA AGA GAA GAG ACT. ACC ,AAA CAG ATC AAC ■ .. -1365 
Ser Val He Ala Glu He Leu Arg Glu Glu Thr Thr Lys Gin He Asn 
405 410 415 

AGT AAT ATT TCA AGA CAA GCT TGG GAT ACT TTA. TGG CCA CCA GAA AGG 1,413 
Ser Asn He Ser Arg Gin Ala Trp Asp Thr Leu Trp Pro Pro Glu Arg 
420 425 430 435 

AAA AGA CAG AGA GCA/.TTC TTT CTC TTT GGT. CTT GCA CTC ATA GTT CAA 1461 
Lys Arg Gin Arg 'Ala Phe Phe Leu Phe Gly Leu Ala Leu He Val Gin 
' 440 445 450 

TTC GAT ACC GAA GGC ATT AGA AGC TTC TTC CGT ACT TTC TTC CGC CTT 1509 
Phe Asp Thr Glu Gly He Arg Ser Phe Phe Aro; Thr Phe Phe Arg Leu 
455 460 465 

CCA AAA TGG ATG TGG CAA GGG TTT CTA GGA TCA ACA TTA ACA TCA GGA 15 57 

Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Thr Leu Thr Ser Gly 
470 ' 475 480 

GAT CTC GTT CTC TTT GCT TTA TAC ATG TTC GTC ATT TCA CCA AAC AAT 1605 
Asp Leu Val Leu ,E>he Ala Leu Tyr Met Phe Val He Ser Pro Asn Asn 
485 ■ ' 490 495 

TTG AGA AAA GGT CTC ATC AAT CAT CTC ATC TCT GAT CCA ACC GGA GCA 1653 
Leu Arg Lys Gly Leu lie Asn His Leu He Ser Asp Pro Thr Gly Ala 
500 " 505 510 515 

ACC ATG ATA AAA ACC TAT CTC AAA GTA TGATTTACTT ATCAACTCTT 1700 
Thr Met He Lys Thr Tyr Leu Lys Val 
520 

AGGTTTGTGT ATATATATGT TGATTTATCT GAATAATCGA TCAAAGAATG GTATGTGGGT 17 60 

TACTAGGAAG TTGGAAACAA ACATGTATAG AATCTAAGGA GTGATCGAAA TGGAGATGGA 1820 

AACGAAAAGA AAAAAATCAG TCTTTGTTTT GTGGTTAGTG 18 60 



<210> 2 
<211> 524 
<212> PRT 

<213> Arabidopsis thaliana 
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<4O0> 2 

Met Glu Cys Val Gly Ala Arg Asn Phe Ala Ala Met Ala Val Ser Thr 
15 10 15 

Phe Pro Ser Trp Ser Cys Arg Arg Lys Phe Pro Val Val Lys Arg Tyr 
20 25 30 

Ser Tyr Arg Asn lie Arg Phe Gly Leu Cys Ser Val Arg Ala Ser Gly 
35 40 45 

Gly Gly Ser Ser Gly Ser Glu Ser Cys Val Ala Val Arg Glu Asp Phe 
50 55 60 

Ala Asp Glu Glu Asp Phe Val Lys Ala Gly Gly Ser Glu lie Leu Phe 
65 ' 70 75 80 

Val Gin Met Gin Gin Asn Lys Asp Met Asp Glu Gin Ser Lys Leu Val 

85 90 1 95 

Asp Lys Leu Pro Pro lie Ser lie Gly Asp Gly Ala Leu Asp His Val 
100 105 110 

Val lie Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala 
115 120 125 

Lys Leu Gly Leu Lys Val Gly Leu lie Gly Pro Asp Leu Pro Phe Thr 
130 135 140 

• ' Asn Asn Tyr Gly Val Trp Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin 
145 150 155 " " 160 

Lys Cys lie Glu His Val Trp Arg Glu Thr lie Val Tyr Leu Asp Asp 
165 170 175 

Asp Lys Pro lie Thr lie Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg 
180 185 ~ 190 

Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu Ser Gly Val Ser 
195 200 205 

Tyr Leu Ser Ser Lys Val Asp Ser lie Thr Glu Ala Ser Asp Gly Leu 
210 215 220 

» 

Arg Leu Val Ala Cys Asp Asp Asn Asn Val lie Pro Cys Arg Leu Ala 
225 230 235 240 

Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val 
245 250 255 

Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu 
260 265 " 270 

Val Glu Asn Ser Pro Tyr Asp Pro Asp Gin Met Val Phe Met Asp Tyr 
275 280 285 

Arg Asp Tyr Thr Asn Glu Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro 
290 295 300 

Thr Phe Leu Tyr Ala Met Pro Met Thr Lys Ser Arg Leu Phe Phe Glu 
305 310 315 320 
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Glu Thr Cys Leu Ala Ser Lys Asp Val -Met Pro Phe Asp Leu Leu Lys 
325 • . 330 335 

Thr Lys Leu 'Met Leu Arg Leu Asp Thr Leu Gly lie Arg lie Leu Lys 
340 345 . 350 

Thr Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser Leu. Pro 
355 . 360 365 

Asn Thr Glu Gin Lvs Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val 
370 375 380 

His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro 
385 . 390 ( 395 400 - 

Lys Tyr Ala,.Ser Val lie Ala Glu lie Leu Arg Glu Glu. Thr Thr Lys 
'4 05 410 415 

Gin lie Asn Ser Asn lie. Ser Arg Gin Ala Txp Asp, Thr Leu Trp Pro ■ 
420 425 ' 430 

Pro Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu Phe Gly Leu Ala Leu 

4 35 4 40 ,4 45 *, 

lie Val Gin Phe Asp Thr Glu Gly He Arg Ser Phe Phe Arg Thr Phe 
450 455 460 

Phe Arg Leu Pro' Lys*Trp Met Trp Gin Gly Phe. Leu Gly Ser Thr Leu 
465 " 470 475 480 

Thr Ser Gly Asp Leu Val Leu Phe Ala Leu Tyr Met Phe Val He Ser 
4 85 4 90 4 95 

Pro Asn Asn Leu Arg Lys Gly Leu He Asn His Leu He Ser Asp Pro 
500 "* 505 510 

Thr Gly Ala Thr Met He Lys Thr Tyr Leu Lys Val 
515 520 

<210> 3 

<211> 956 • 

<212> DNA 

<213> Arabidopsis thaliana 
<400> 3 



GCTCTTTCTC 


CTCCTCCTCT 


ACCGATTTCC 


GACTCCGCCT 


CCCGAAATCC 


TTATCCGGAT 


60 


TCTCTCCGTC 


TCTTCGATTT 


AAACGCTTTT 


CTGTCTGTTA 


CGTCGTCGAA 


G AAC G GAG AC 


120 


AGAATTCTCC 


GAT-TGAGAAC 


GATGAGAGAC 


CGGAGAGCAC 


GAGCTCCACA 


AACGCTATAG - 


180 


ACGCTGAGTA 


TCTGGCGTTG 


CGTTTGGCGG 


AGAAATTGGA 


GAGGAAGAAA 


TCGGAGAGGT 


240 


CCACTTATCT 


AATCGCTGCT 


ATGTTGTCGA 


GCTTTGGTAT 


CACTTCTATG 


GCTGTTATGG 


300 


CTGTTTACTA 


CAGATTCTCT 


TGGCAAATGG 


AGGGAGGTGA 


GATCTCAATG 


TTGGAAATGT 


360 


TTGGTACATT 


TGCTCTCTCT 


GTTGGTGCTG 


CTGTTGGTAT 


GGAATTCTGG 


GCAAGATGGG 


420 


CTCATAGAGC 


TCTGTGGCAC 


GCTTCTCTAT 


GGAATATGCA 


TGAGTCACAT 


CACAAACCAA 


480 
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GAGAAGGACC 


GTTTGAGCTA 


AACGATGTTT 


TTGCTATAGT 


GAACGCTGGT 


CCAGCGATTG 


540 


GTCTCCTCTC 


TTATGGATTC 


TTCAATAAAG 


GACTCGTTCC 


TGGTCTCTGC 


TTTGGCGCCG 


600 


GGTTAGGCAT 


AACGGTGTTT 


GGAATCGCCT 


ACATGTTTGT 


CCACGATGGT 


CTCGTGCACA 


660 


AGCGTTTCCC 


TGTAGGTCCC 


ATCGCCGACG 


TCCCTTACCT 


CCGAAAGGTC 


GCCGCCGCTC 


720 


ACCAGCTACA 


TCACACAGAC 


AAGTTCAATG 


GTGTACCATA 


TGGACTGTTT 


CTTGGACCCA 


780 


AGGAATTGGA 


AGAAGTTGGA 


GGAAATGAAG 


AG T T AG AT AA 


GGAGATTAGT 


CGGAGAATCA 


840 


AATCATACAA 


AAAGGCCTCG 


GGCTCCGGGT 


CGAGTTCGAG 


TTCTTGACTT 


TAAACAAGTT 


« 

900 


TTAAATCCCA 


AATTCTTTTT 


TTGTCTTCTG 


TCATTATGAT 


CATCTTAAGA 


CGGTCT 


956 



<210> 4 
<211> 294 . 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 4 ' 

Ser Phe Ser Ser Ser Ser Thr Asp Phe Arg Leu Arg Leu Pro Lys Ser 
15 10 15 

Leu Ser Gly Phe Ser Pro Ser Leu Arg Phe Lys Arg Phe Ser Val Cys 
20 25 = 30 

Tyr Val Val Glu Glu Arg Arg Gin Asn Ser Pro He Glu Asn Asp Glu 
35 4 0 4 5 

Arg Pro Glu Ser Thr Ser Ser Thr Asn Ala He Asp Ala Glu Tyr Leu 
50 55 60 

Ala Leu Arg Leu Ala Glu Lys Leu Glu Arg Lys Lys Ser Glu Arg Ser 
,65 70 75 80 

Thr Tyr Leu He Ala Ala Met Leu Ser Ser Phe Gly He Thr Ser Met 
85 90 " 95 

Ala Val Met Ala Val Tyr Tyr Arg Phe Ser Trp Gin Met Glu Gly Gly 
100 105 HO 

- V 

Glu lie Ser Met Leu Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly 
115 120 125 

Ala Ala Val Gly Met Glu Phe Trp Ala Arg Trp Ala His Arg Ala Leu 
130 135 140 

Trp His Ala Ser Leu Trp Met Asn His Glu Ser His His Lys Pro Arg 
1^5 150 155 160 

Glu Gly Pro Phe Glu Leu Asn Asp Val Phe Ala lie Val Asn Ala Gly 
165 170 175 

Pro Ala lie Gly Leu Leu Ser Tyr Gly Phe Phe Asn Lys Gly Leu Val 
180 185 190 

Pro Gly Leu Cys Phe Gly Ala Gly Leu Gly lie Thr Val Phe Gly lie 
195 200 205 
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Ala Tyr Met Phe Val His Asp Gly Leu* Val His Lys Arg Phe Pro Val 
210 • 215 * 220 

Gly Pro lie" Ala Asp Val Pro Tyr Leu Arg Lys Val Ala Ala Ala His 
225 230 235 240 

Gin Leu His His Thr Asp Lys Phe Asn Gly Val Pro Tyr Gly Leu Phe 
•245 250 255 

Leu Gly Pro Lys Glu Leu Glu Glu Val Gly Gly Asn Glu Glu Leu Asp 
260 265 270 

Lys Glu lie Ser Arg Arg lie Lys Ser Tyr Lys Lys Ala Ser Gly Ser 
275 280 , 285 

Gly Ser Ser«Ser,Ser Ser 
290 

<210> 5 . ■ ■ • • . . • 

<211> 162 
<212> PRT 

<213> Alicalgenes sp. 

<400> 5 ' 
Met Thr Gin Phe Leu lie Val Val Ala Thr Val Leu Val Met Glu Leu 
1 5 10 15 • 

Thr Ala Tyr Ser Val His Arg Trp lie Met His Gly Pro Leu Gly Trp 
20 25 30 

Gly Trp His Lys Ser His His Glu Glu His Asp His Ala Leu Glu Lys 
35 40 45 

Asn Asp Leu Tyr Gly Val Val Phe Ala Val Leu Ala Thr lie Leu Phe 
5 0 55 60 

Thr Val Gly Ala Tyr Trp Trp Pro Val Leu Trp Trp lie Ala Leu Gly 
65 ^ 70 75 80 

Met Thr Val Tyr, Gly Leu lie Tyr Phe lie Leu His Asp Gly Leu Val 
• • 85 90 95 

His Gin Arg Trp Pro Phe Arg Tyr lie Pro Arg Arg Gly Tyr Phe Arg 
100 105 110 

Arg Leu Tyr Gin Ala His Arg Leu His His Ala Val Glu Gly Arg Asp 
115 * 120 125 

His Cys Val Ser Phe Gly Phe lie Tyr Ala Pro Pro Val Asp Lys Leu 

- 130 - * 135 14 0 . 

Lys Gin Asp Leu Lys Arg Ser Gly Val Leu Arg Pro Gin Asp Glu Arg 
145 1 150 155 160 

Pro Ser 



<210> 6 

<211> 175 

<212> PRT 

<213> Erwinia herbicola 
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<400> 6 

Met Leu Asn ( Ser Leu lie Val He Leu Ser Val He Ala Met Glu -Gly 
1*5 10 15 

He Ala Ala Phe Thr His Arg Tyr lie Met His Gly Trp Gly Trp Arg 
20 25 ' 30 

Trp His Glu Ser His His Thr Pro Arg Lys Gly Val Phe Glu Leu Asn 
35 40 45 

Asp Leu Phe Ala Val Val Phe Ala Gly Val Ala He Ala Leu He Ala 
50 55 60 

Val Gly Thr Ala Gly Val Trp Pro Leu Gin Trp He Gly Cys Gly Met 
65 70 75 80 

Thr Val Tyr Gly Leu Leu Tyr Phe Leu Val His Asp Gly Leu Val His 

8 5 90 ' 95 

Gin Arg Trp Pro Phe His Trp lie Pro Arg Arg Gly Tyr Leu Lys Arg 
100 105 110 

Leu Tyr Val Ala His Arg Leu His His Ala Val Arg Gly Arg Glu Gly 
115 120 125 

Cys Val Ser Phe Gly Phe He Tyr Ala Arg Lys Pro Ala Asp Leu Gin 
130 135 140 

Ala He Leu Arg Glu Arg His Gly Arg Pro Pro Lys Arg Asp Ala Ala 
145 150 155 * 160 

Lys Asp Arg Pro Asp Ala Ala Ser Pro Ser Ser Ser Ser Pro Glu 
165 170 175 

<210> 7 
<211> 175 
<212> PRT 

<213> Erwinia uredovora 
<400> 7 

Met Leu Trp He Trp Asn Ala Leu He Val Phe Val Thr Val He Gly 
15 10 15 

» 

Met Glu Val He Ala Ala Leu Ala His Lys Tyr He Met His Gly Trp 
20 25 "* 30 

Gly Trp Gly Trp His Leu Ser His His Glu Pro Arg Lys Gly Ala Phe 
35 40 45 

Glu Val Asn Asp Leu Tyr Ala Val Val Phe Ala Ala Leu Ser He Leu 
50 55 60 

Leu He Tyr Leu Gly Ser Thr Gly Met Trp Pro Leu Gin Trp lie Gly 
65 70 75 80 

Ala Gly Met Thr Ala Tyr Gly Leu Leu Tyr Phe Met Val His Asp Gly 

85 90 95 

Leu Val His Gin Arg Trp Pro Phe Arg Tyr lie Pro Arg Lys Gly Tyr 
100 105 110 
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Leu Lys Arg Leu Tyr Met Ala His Arg* Met His Has Ala Val Arg Gly 
115 120 . 125 

Lys Glu Gly' Cys Val Ser Phe Gly Phe Leu Tyr Ala Pro Pro Leu Ser 
130 135 140 

Lys Leu Gin Ala Thr Leu Arg Glu Arg His Gly Ala Arg Ala Gly Ala 
145 , 150 155 160 

Ala Arg Asp Ala Gin Gly Gly Glu Asp Glu Pro Ala Ser Gly Lys 
165 170 175 

<210> 8 
<211> 162 
<212> PRT 

<213> Agrofc>apterium aurianticum 
<400> 8 

Met Thr Asn Phe Leu lie Val Val Ala Thr Val Leu Val Met Glu Leu. 
1 5 10 15 

Thr Ala Tyr Ser Val His Arg Trp lie Met His GTy Pro Leu Gly Trp 

20 25 • '30 ^ - t 

Gly Trp His Lys Ser His His Glu Glu His Asp His Ala Leu Glu Lys 
35 -40 45 

Asn Asp Leu Tyr Gly Leu Val Phe Ala' Val lie Ala Thr Val Leu Phe 
50 ' 55 "60 

Thr Val Gly Trp lie Trp Ala Pro Val Leu Trp* Trp lie Ala Leu Gly 
65 " 70 75 80 

Met Thr Val Tyr Gly Leu lie Tyr Phe Val Leu His Asp Gly Leu Val 
85 90 95 

His Trp Arg Trp Pro Phe Arg Tyr lie Pro Arg Lys Gly Tyr Ala Arg 
100 105 110 

Arg Leu Tyr Gin Ala His Arg Leu His His Ala Val Glu Gly Arg Asp 
115 , 120 125 

His Cys Val Ser Phe Gly Phe lie Tyr Ala Pro Pro Val Asp Lys Leu 
130 135 140 

Lys Gin Asp Leu Lys Met Ser Gly Val Leu Arg Ala Glu Ala Gin Glu 
145 4 150 155 160 

Arg Thr 

<210> 9 
<211> 954 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 9 

CCACGGGTCC GCCTCCCCGT TTTTTTCCGA TCCGATCTCC GGTGCCGAGG ACTCAGCTGT 60 
TTGTTCGCGC TTTCTCAGCC GTCACCATGA CCGATTCTAA CGATGCTGGA ATGGATGCTG 120 
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TTCAGAGACG ACTCATGTTT GAAGACGAAT GCATTCTCGT TGATGAAAAT AATCGTGTGG , 180 

TGGGACATGA CACTAAGTAT AACTGTCATC TGATGGAAAA GATTGAAGCT GAGAATTTAC 24.0 

TTCACAGAGC TTTCAGTGTG TTTTTATTCA ACTCCAAGTA TGAGTTGCTT CTCCAGCAAC 300 

GGTCAAAAAC AAAGGTTACT TTCCCACTTG TGTGGACAAA CACTTGTTGC AGCCATCCTC 360 

TTTACCGTGA ATCCGAGCTT ATTGAAGAGA ATGTGCTTGG TGTAAGAAAT GCCGCACAAA 4 20 

GGAAGCTTTT CGATGAGCTC GGTATTGTAG CAGAAGATGT ACCAGTCGAT GAGTTCACTC 4 80 

CCTTGGGACG CATGCTTTAC AAGGCACCTT CTGATGGGAA ATGGGGAGAG CACGAAGTTG 54 0 

ACTATCTACT CTTCATCGTG CGGGATGTGA AGCTTCAACC AAACCCAGAT GAAGTGGCTG 600 

AGATCAAGTA CGTGAGCAGG GAAGAGCTTA AGGAGCTGGT GAAGAAAGCA GATGCTGGCG 660 

ATGAAGCTGT GAAACTATCT CCATGGTTCA GATTGGTGGT GGATAATTTC TTGATGAAGT 720 

GGTGGGATCA TGTTGAGAAA GGAACTATCA CTGAAGCTGC AGACATGAAA ACCATTCACA 78 0 

AGCTCTGAAC TTTCCATAAG TTTTGGATCT TCCCCTTCCC ATAATAAAAT TAAGAGATGA 84 0 

GACTTTTATT GATTACAGAC AAAACTGGCA ACAAAATCTA TTCCTAGGAT TTTTTTTTGC 900 

TTTTTATTTA CTTTTGATTC ATCTCTAGTT TAGTTTTCAT CTTAAAAAAA AAAA 954 

<210> 10 
<211> 996 
<212> DNA 

<213> Arabidopsis thaliana 



<400> 10 



CACCAATGTC 


TGTTTCTTCT 


TTATTTAATC 


TCCCATTGAT 


TCGCCTCAGA 


TCTCTCGCTC 


60 


TTTCGTCTTC 


TTTTTCTTCT 


TTCCGATTTG 


CCCATCGTCC 


TCTGTCATCG 


ATTTCACCGA 


120 


GAAAGTTACC 


GAATTTTCGT 


GCTTTCTCTG 


GTACCGCTAT 


GACAGATACT 


AAAGATGCTG 


180 


GTATGGATGC 


TGTTCAGAGA 


CGTCTCATGT 


TTGAGGATGA 


ATGCATTCTT 


GTTGATGAAA 


240 


CTGATCGTGT 


TGTGGGGCAT 


GTCAGCAAGT 


ATAATTGTCA 


TCTGATGGAA 


AATATTGAAG 


300 


CCAAGAATTT 


GCTGCACAGG 


GCTTTTAGTG 


TATTTTTATT 


CAACTCGAAG 


TATGAGTTGC 


360 


TTCTCCAGCA 


AAGGTCAAAC 


ACAAAGGTTA 


CGTTCCCTCT 


AGTGTGGACT 


AACACTTGTT 


420 


GCAGCCATCC 


TCTTTACCGT 


GAATCAGAGC 


TTATCCAGGA 


CAATGCACTA 


GGTGTGAGGA 


480 


ATGCTGCACA 


AAGAAAGCTT 


CTCGATGAGC 


TTGGTATTGT 


AGCTGAAGAT 


GTACCAGTCG 


540 


ATGAGTTCAC 


TCCCTTGGGA 


CGTATGCTGT 


ACAAGGCTCC 


TTCTGATGGC 


AAATGGGGAG 


600 


AGCATGAACT 


TGATTACTTG 


CTCTTCATCG 


TGCGAGACGT 


GAAGGTTCAA 


CCAAACCCAG 


660 


ATGAAGTAGC 


TGAGATCAAG 


TATGTGAGCC 


GGGAAGAGCT 


GAAGGAGCTG 


GTGAAGAAAG 


720 


CAGATGCAGG 


TGAGGAAGGT 


TTGAAACTGT 


CACCATGGTT 


CAGATTGGTG 


GTGGACAATT 


780 



10 
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TCTTGATGAA GTGGTGGGAT CATGTTGAGA 



AAGGAACTT? GGTTGAAGCT ATAGACATGA 



840 



AAACCATCCA C^AACTCTGA ACATCTTTTT 



TTAAAGTTTT TAAATCAATC AACTTTCTCT 



900 



TCATCATTTT TATCTTTTCG ATGATAATAA 



TTTGGGATAT GTGAGACACT TACAAAACTT 



960 



CCAAGCACCT CAGGCAATAA TAAAGTTTGC 



GGCCGC 



996 



<210> 11 
<211> 1165 
<212> DNA 

<213> Haematococcus pluvialis 
<400> 11 



CTCGGTAGCT 


GGGCACAATC 


GCTATTTGGA 


ACCTGGCCCG 


GCGGCAGTCC 


GATGCCGCGA 


60 


TGCTTCGTTC 


GTTGCTCAGA 


GGCCTCACGC 


ATATCCCCCG 


CGTGAACTCC 


GCCCAGCAGC 


120 


CCAGCTGTGC 


ACACGCGCGA 


CTCCAGTTTA 


• 

AGCTCAGGAG 


CATGCAGATG 


ACGCTCATGC 


180 


AGCCCAGCAT 


CTCAGCCAAT 


CTGTCGCGCG 


CCGAGGACCG 


CACAGACCAC 


ATGAGGGGTG 


240 


CAAGCACCTG 


GGCAGGCGGG 'CAGTCGCAGG 


ATGAGCTGAT 


GCTGAAGGAC 


GAGTGCATCT 


300 


TGGTGGATGT 


TGAGGACAAC 


ATCACAGGCC 


ATGCCAGCAA 


GCTGGAGTGT 


CACAAGTTCC 


360 


TACCACATCA 


GCCTGCAGGC 


CTGCTGCACC 


GGGCCTTCTC 


TGTGTTCCTG 


TTTGACGATC 


■420 


AGGGGCGACT 


GCTGCTGCAA 


CAGCGTGCAC 


GCTCAAAAAT 


CACCTTCCCA 


AGTGTGTGGA 


480- 


CGAACACCTG 


CTGCAGCCAC 


CCTTTACATG 


GGCAGACCCQ 


AGATGAGGTG 


GACCAACTAA 


540 


GCCAGGTGGC 


CGACGGAACA 


GTAGCTGGCG 


CAAAGGCTGC 


TGCCATCCGC 


AAGTTGGAGC 


600 


ACGAGCTGGG 


GATACCAGCG 


CACCAGCTGC 


CGGCAAGCGC 


GTTTCGCTTC 


CTCACGCGTT 


660 


TGCACTACTG 


TGCCGCGGAC 


GTGCAGCCAG 


CTGCGACACA 


ATCAGCGCTC 


TGGGGCGAGC 


720 


ACGAAATGGA 


CTACATCTTG 


TTCATCCGGG 


CCAACGTCAC 


CTTGGCGCCC 


AACCCTGACG 


780 


AGGTGGACGA 


AG T C AG G T AC 


GTGACGCAAG 


AGGAGCTGCG 


GCAGATGATG 


CAGCCGGACA 


840 


ACGGGCTGCA 


ATGGTCGCCG 


TGGTTTCGCA 


TCATCGCCGC 


GCGCTTCCTT 


GAGCGTTGGT 


900 


GGGCTGACCT 


GGACGCGGCC 


CTAAACACTG 


ACAAACACGA 


GGATTGGGGA 


ACGGTGCATC 


960 


ACATCAACGA 


AGCGTGAAAG 


CAGAAGCTGC 


AGGATGTGAA 


GACACGTCAT 


GGGGTGGAAT 


1020 


TGCGTACTTG- 


GCAGCTTCGT- 


ATCTCCTTTT 


TCTGAGACTG 


AACCTGCAGT 


CAGGTCCCAC 


1080 


AAGGTCAGGT 


AAAATGGCTC 


GATAAAATGT 


ACCGTCACTT 


TTTGTCGCGT 


ATACTGAACT 


1140 


CCAAGAGGTC 


AAAAAAAAAA 


AAAAA 








1165 



<210> 12 
<211> 1135 
<212> DNA 

<213> Haematococcus pluvialis 



M 
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<400> 12 
CTCGGTAGCT 


qGCCACAATC 


GCTATTTGGA 


ACCTGGCCCG 


GCGGCAGTCC 


GATGCCGCGA 


60 


TGCTTCGTTC 


GTTGCTCAGA 


GGCCTCACGC 


ATATCCCGCG 


CGTGAACTCC 


GCCCAGCAGC 


120 


CCAGCTGTGC 


ACACGCGCGA 


CTCCAGTTTA 


AGCTCAGGAG 


CATGCAGCTG 


CTTTCCGAGG 


180 


ACCGCACAGA 


CCACATGAGG 


GGTGCAAGCA 


CCTGGGCAGG 


CGGGCAGTCG 


CAGGATGAGC 


240 


TGATGCTGAA 


GGACGAGTGC 


ATCTTGGTAG 


ATGTTGAGGA 


CAACATCACA 


GGCCATGCCA 


300 


GCAAGCTGGA 


GTGTCACAAG 


TTCCTACCAC 


ATCAGCCTGC 


AGGCCTGCTG 


CACCGGGCCT 


360 


TCTCTGTGTT 


CCTGTTTGAC 


GATCAGGGGC 


GACTGCTGCT 


GCAACAGCGT 


GCACGCTCAA 


420 


AAATCACCTT 


CCCAAGTGTG 


TGGACGAACA 


CCTGCTGCAG 


CCACCCTTTA 


CATGGGCAGA 


480 


CCCCAGATGA 


. GGTGGACCAA 


CTAAGCCAGG 


TGGCCGACGG 


AACAGTACCT 


GGCGCAAAGG 


540 


CTGCTGCCAT 


CCGCAAGTTG 


GAGCACGAGC 


TGGGGATACC 


AGCGCACCAG 


CTGCCGGCAA 


600 


GCGCGTTTCG 


CTTCCTCACG 


CGTTTGCACT 


ACTGTGCCGC 


GGACGTGCAG 


CCAGCTGCGA 


660 


CACAATCAGC 


GCTCTGGGGC 


GAGCACGAAA 


TGGACTACAT 


CTTGTTCATC 


CGGGCCAACG 


720 


TCACCTTGGC 


GCCCAACCCT 


GACGAGGTGG 


ACGAAGTCAG 


GTACGTGACG 


CAAGAGGAGC 


7 80 


TGCGGCAGAT 


GATGCAGCCG 


GACAACGGGC 


TTCAATGGTC 


GCCGTGGTTT 


CGCATCATCG 


840 


CCGCGCGCTT 


CCTTGAGCGT 


TGGTGGGCTG 


ACCTGGACGC 


GGCCCTAAAC 


ACTGACAAAC 


900 


ACGAGGATTG 


GGGAACGGTG 


C A T C AC AT C A 


ACGAAGCGTG 


AAGGCAGAAG 


CTGCAGGATG 


960 


TGAAGACACG 


TCATGGGGTG 


GAATTGCGTA 


CTTGGCAGCT 


TCGTATCTCC 


TTTTTCTGAG 


1020 


ACTGAACCTG 


CAGAGCTAGA 


GTCAATGGTG 


CATCATATTC 


ATCGTCTCTC 


TTTTGTTTTA 


1080 


GACTAATCTG 


TAGCTAGAGT 


CACTGATGAA 


TCCTTTACAA 


CTTTCAAAAA 


AAAAA 


1135 


<210> 13 
<211> 960 
<212> DNA 

<213> Tagetes erecta 












<400> 13 
CCAAAAACAA 


CTCAAATCTC 


CTCCGTCGCT 


CTTACTCCGC 


CATGGGTGAC 


GACTCCGGCA 


60 


TGGATGCTGT 


TCAGCGACGT 


CTCATGTTTG 


ACGATGAATG 


CATTTTGGTG 


GATGAGTGTG 


120 


ACAATGTGGT 


GGGACATGAT 


ACCAAATACA 


ATTGTCACTT 


GATGGAGAAG 


ATTGAAACAG 


180 


GTAAAATGCT 


GCACAGAGCA 


TTCAGCGTTT 


TTCTATTCAA 


TTCAAAATAC 


GAGTTACTTC 


240 


TTCAGCAACG 


GTCTGCAACC 


AAGGTGACAT 


TTCCTTTAGT 


ATGGACCAAC 


ACCTGTTGCA 


300 


GCCATCCACT 


CTACAGAGAA 


TCCGAGCTTG 


TTCCCGAAAC 


GCCTGAGAGA 


ATGCTGCACA 


360 


GAGGANNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


420 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


4 80 



12- 
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NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 54 0 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 600 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 660 

NNNNNNNNNN NNNNNNNNNN TCATGTGCAA AAGGGTACAC TCACTGAATG CAATTTGATA 7 20 

TGAAAACC AT ACACAAGCTG ATATAGAAAC ACACCCTCAA CCGAAAAGCA AGCCTAATAA 780 

TTCGGGTTGG GTCGGGTCTA CCATCAATTG TTTTTTTCTT TTAACAACTT TTAATCTCTA 84 0 

TTTGAGCATG TTGATTCTTG TCTTTTGTGT GTAAGATTTT GGGTTTCGTT TCAGTTGTAA 900 

TAATGAACCA TTGATGGTTT GCAATTTCAA GTTCCTATCG ACATGTAGTG ATCTAAAAAA 960 
" * ■« 

<210> 14 

<211> 305 . ■ . . , ■ . ■ 

<212> PRT 

<213> Haematococcus pluvialis 

<400> 14 . 

Met Leu Arg Ser Leu Leu Arg Gly Leu Thr His lie Pro Arg Val Asn. 
1. 5 10 15 

Ser Ala Gin Gin Pro ^er Cys Ala His Ala. Arg Leu Gin Phe Lys Leu 
20/ * 25 30 

Arg Ser Met Gin Met Thr Leu Met Gin Pro Ser lie Ser Ala Asn Leu 
35 40 45 

Ser Arg Ala Glu Asp Arg Thr Asp His Met Arg Gly Ala Ser Thr Trp 
50 55 60 

Ala Gly Gly Gin Ser Gin Asp Glu Leu Met. Leu Lys Asp Glu Cys lie 
65 70 75 80 

Leu Val Asp Val Glu Asp Asn lie Thr Gly His Ala Ser Lys Leu Glu 
.,,85 90 95 

Cys His Lys Phe Leu Pro His Gin Pro Ala Gly Leu Leu His Arg Ala 
100 105 110 

Phe Ser Val Phe Leu Phe Asp Asp Gin Gly Arg Leu Leu Leu Gin Gin 
115 120 125 

Arg Ala Arg Ser Lys He Thr Phe Pro Ser Val Trp Thr Asn Thr Cys 
130 ~ 135 140 

Cys Ser His Pro Leu His Gly Gin Thr Pro Asp Glu Val Asp Gin Leu 
145 150 ^ 155 160 

Ser Gin Val Ala Asp Gly Thr Val Pro Gly Ala Lys Ala Ala Ala He 
165 170 175 

Arg Lys Leu Glu His Glu Leu Gly He Pro Ala His Gin Leu Pro Ala 
180 185 190 

Ser Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cys Ala Ala Asp Val 
195 200 205 
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Gin Pro Ala Ala Thr Gin Ser Ala Leu Trp Gly Glu His Glu Met Asp 
210 r 215 220 

Tyr lie Leu Phe lie Arg Ala Asn Val Thr Leu Ala Pro Asn Pro Asp 
225 230 235 240 

Glu Val Asp Glu Val Arg Tyr Val Thr Gin Glu Glu Leu Arg Gin Met 
245 250 255 

Met Gin Pro Asp Asn Gly Leu Gin Trp Ser Pro Trp. Phe Arg lie lie 
260 265 ' 270 

Ala Ala Arg Phe Leu Glu Arg Trp Trp Ala Asp Leu Asp Ala Ala Leu 
275 280 , 285 

Asn Thr Asp Lys His Glu Asp Trp Gly Thr Val His His He Asn Glu 
290 295 300 

Ala 
305 

<210> IS 
<211> 293 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 15 

Met Leu Arg 'Ser Leu Leu Arg Gly Leu Thr His He .Pro Arg Val Asn 
15 10 15 

Ser Ala Gin Gin Pro Ser Cys Ala His Ala Arg Leu Gin Phe Lys Leu 
20 25 30 

Arg Ser Met Gin Leu Leu Ser Glu Asp Arg Thr Asp His Met Arg Gly 
35 40 " 45 

Ala Ser Thr Trp Ala Gly Gly Gin Ser Gin Asp Glu Leu Met Leu Lys 
50 55 60 

Asp Glu Cys He Leu Val Asp Val Glu Asp Asn He Thr Gly His Ala 
65 70 75 80 

Ser Lys Leu *Glu Cys His Lys Phe Leu Pro His Gin Pro Ala Gly Leu 
85 90 95 

Leu His Arg Ala Phe Ser Val Phe Leu Phe Asp Asp Gin Gly Arg Leu 
100 105 110 

Leu Leu Gin Gin Arg Ala Arg Ser Lys He Thr Phe Pro Ser Val Trp 
115 120 125 

Thr Asn Thr Cys Cys Ser His Pro Leu His Gly Gin Thr Pro Asp Glu 
130 135 140 

Val Asp Gin Leu Ser Gin Val Ala Asp Gly Thr Val Pro Gly Ala Lys 
145 150 * 155 160 

Ala Ala Ala He Arg Lys Leu Glu His Glu Leu Gly He Pro Ala His 
165 170 175 

Gin Leu Pro Ala Ser Ala Phe "Arg Phe Leu Thr Arg Leu His Tyr Cys 



tM 
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1 5B0 185-' ,# : ' * • 190 

Ala Ala Asd Val Gin Pro Ala Ala Thr Gin Ser Ala Leu Tro Gly Glu 

195* 200 205 . 

His Glu Met Asp Tyr lie Leu Phe lie Arq Ala Asn Val Thr Leu Ala 
210 215 220 

Pro Asn Pro Asp Glu Val Asp Glu Val Arg Tyr Val Thr Gin Glu Glu 
225 230 235 240 

Leu Arg Gin Met Met Gin Pro Asp Asn Gly Leu Gin Trp Ser Pro Trp 
245 250 255 

Phe Arg lie lie Ala Aia Arg Phe Leu Glu Arg Trp Trp Ala Asp Leu 
260 s 265 . . 270 

Asp Ala Ala Leu Asn Thr Asp Lys His Glu Asp Trp Gly Thr Val His 

275 , ... 280 . , 285 ... 

His lie Asn Glu Ala 

290 . ' 

<210> 16 ' 
<211> 284 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 16 

Met Ser Val Ser Ser Leu Phe Asn Leu Pro Leu lie Arg Leu Arg Ser 
1 5 10 15 

Leu Ala Leu Ser Ser Ser Phe Ser Ser Phe Arg Phe Ala His Arg Pro 
20 25 30 

Leu Ser Ser lie Ser Pro Arg Lys Leu Pro Asn Phe Arg Ala Phe Ser 
35 40 45 

Gly Thr Ala Met Thr Asp Thr Lys Asp Ala Gly Met Asp Ala Val Gin 
50 * 55 * 60 

Arg Arg Leu Met Phe Glu Asp Glu Cys lie Leu Val Asp Glu Thr Asp 
65 70 75 80 

Arg Val Val Gly His Val Ser Lys Tyr Asn Cys His Leu Met Glu Asn 
85 90 95 

lie Glu Ala Lys Asn Leu Leu His Arg Ala Phe Ser Val Phe Leu Phe 
100 105 110 



Asn Ser Lys Tyr Glu Leu Leu Leu Gin Gin Arg Ser Asn Thr Lys Val 
115 120 125 

Thr Phe Pro Leu Val Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr 
130 135 140 

Arg Glu Ser Glu Leu lie Gin Asp Asn Ala Leu Gly Val Arg Asn Ala 
145 150 155 160 

Ala Gin Arg Lys Leu Leu Asp Glu Leu Gly lie Val Ala Glu Asp Val 
165 170 175 
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Pro Val Asp Glu Phe Thr Pro Leu Gly Arg Met Leu Tyr Lys Ala Pro 
♦ ,180 185 190 

Ser Asp Gly Lys Trp Gly Glu His Glu Leu Asp Tyr Leu Leu Phe lie 
195 200 205 

Val Arg Asp Val Lys Val Gin Pro Asn Pro Asp Glu Val Ala Glu lie 
210 215 220 

Lys Tyr Val Ser Arg Glu Glu Leu Lys Glu Leu Val. Lys Lys Ala Asp 
225 230 235 240 

Ala Gly Glu Glu Gly Leu Lys Leu Ser Pro Trp Phe Arg Leu Val Val 
245 250 255 

Asp Asn Phe Leu Met Lys Trp Trp Asp His Val Glu Lys Gly Thr Leu 
260 265 270 

Val Glu Ala lie Asp Met Lys Thr lie His Lys Leu 
275 280 

<210> 17 
<211> 287 
<212> PRT 

<213> Clarkia breweri 
<400> 17 

Met Ser Ser Ser Met Leu Asn Phe Thr Ala Ser Arg .lie Val Ser Leu 
1 5 10 15 

Pro Leu Leu Ser Ser Pro Pro Ser Arg Val His Leu Pro Leu Cys Phe 
20 25 30 

Phe Ser Pro lie Ser Leu Thr Gin Arg Phe Ser Ala Lys Leu Thr Phe 
35 40 45 

Ser Ser Gin Ala Thr Thr Met Gly Glu Val Val Asp Ala Gly Met Asp 
50 55 60 

Ala Val Gin Arg Arg Leu Met Phe Glu Asp Glu Cys lie Leu Val Asp 
65 70 75 80 

Glu Asn Asp >Lys Val Val Gly His Glu Ser Lys Tyr Asn Cys His Leu 
85 90 ' 95 

Met Glu Lys lie Glu Ser Glu Asn Leu Leu His Arg Ala Phe Ser Val 
100 105 " 110 

Phe Leu Phe Asn Ser Lys Tyr Glu Leu Leu Leu Gin Gin Arg Ser Ala 
115 " 120 125 

Thr Lys Val Thr Phe Pro Leu Val Trp Thr Asn Thr Cys Cys Ser His 
130 135 140 

Pro Leu Tyr Arg Glu Ser Glu Leu lie Asp Glu Asn Cys Leu Gly Val 
145 150 155 160 

Arg Asn Ala Ala Gin Arg Lys Leu Leu Asp Glu Leu Gly lie Pro Ala 
165 170 175 

Glu Asp Leu Pro Val Asp Gin Phe lie Pro Leu Ser Arg lie Leu Tyr 
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180 18 5 V'' 190 

Lys Ala Pro Ser Asp Gly Lys Trp Glv Glu His Glu Leu Asp Tyr Leu 
195 ' 200 205 

Leu Phe lie lie Arg Asp Val Asn Leu Asp Pro Asn Pro Asp Glu Val 
210 215 220 

Ala Glu Val Lys Tyr Met Asn Arg Asp Asp Leu Lys Glu Leu Leu Arg 
225 ^ 230 235 . 240 

Lys Ala Asp Ala Glu Glu Glu Gly Val Lys Leu Ser Pro Trp Phe Arg 
245 250 255 

Leu Val Val Asp Asn Phe Leu Phe Lys Trp Trp Asp His Val Glu Lys 
260. (< 265 .270 

Gly Ser Leu Lys -Asp Ala Ala Asp Met Lys Thr lie His Lys Leu 
275 . 280 ,. . ,285 

<210> 18 

<211> 261 . ' , 

<212> PRT . 

<213> Arabidopsis thaliana , 
<400> 18 

Thr Gly Pro Pro Pro »Arg Phe Phe Pro lie Arg Ser Pro Val Pro Arg 
i ; 5'* 10 ' 15 

Thr Gin Leu Phe Val Arg Ala Phe Ser Ala Val Thr Met Thr Asp Ser 
20 25 30 

Asn Asp Ala Gly Met Asp Ala Val Gin Arg Arg Leu Met Phe Glu Asp 
35 .40 4 5 

Glu Cys lie Leu Val Asp Glu Asn Asn Arg Val Val Gly His Asp Thr 
50 55" 60 

Lys Tyr Asn Cys His Leu Met Glu Lys lie Glu Ala Glu Asn Leu Leu 
65 „ ... 70 75 80 

His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu Leu Leu 
8 5 90 95 

Leu Gin Gin Arg Ser Lys Thr Lys Val Thr Phe Pro Leu Val Trp Thr 
100 * 105 110 

Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser Glu Leu lie Glu 
115 120 125 

Glu Asn Val Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu Phe Asp 
130 135 140 

Glu Leu Gly lie Val Ala Glu Asp Val Pro Val Asp Glu Phe Thr Pro 
145 ^ 150 155 160 

Leu Gly Arg Met Leu Tyr Lys Ala Pro Ser Asp Gly Lys Trp Gly Glu 
165 170 175 

His Glu Val Asp Tyr Leu Leu Phe lie Val Arg Asp Val Lys Leu Gin 
180 185 190 



n 
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Pro Asn Pro Asp Glu Val Ala Glu lie Lys Tyr Val Ser Arg Glu Glu 
195 ( , 200 ** 205 

Leu Lys Glu Leu Val Lys Lys Ala Asp Ala Gly Asp Glu Ala Val Lys 
210 215 220 

Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe Leu Met Lys Trp 
225 230 235 240 

Trp Asp His Val Glu Lys Gly Thr lie Thr Glu Ala. Ala Asp Met Lys 
245 250 255 

Thr lie His Lys Leu 
260 

<210>,19 
<211> ' 288 
<212> PRT . 

<213> Saccharomyces cerevisiae 
<400> 19 

Met Thr 'Ala Asp Asn Asn Ser Met Pro' His Gly Ala Val Ser Ser Tyr 
15 10 15 

Ala Lys Leu Val Gin Asn Gin Thr Pro Glu Asp lie Leu Glu Glu Phe 
20 25 30 

Pro Glu lie 'lie Pro Leu Gin Gin Arg Pro Asn Thr ,Arg Ser Ser Glu 
35 40 45 

Thr Ser Asn Asp Glu Ser Gly Glu Thr Cys Phe Ser Gly His Asp Glu 
5 0 55 60 

Glu Gin lie Lys Leu Met Asn Glu Asn Cys lie Val Leu Asp Trp Asp 
65 70 75 * 80 

Asp Asn Ala lie Gly Ala Gly Thr Lys Lys Val Cys His Leu Met Glu 

8 5 90 95 

Asn lie Glu Lys Gly Leu Leu His Arg Ala Phe Ser Val Phe lie Phe 
100 105 110 

Asn Glu Gin vGly Glu Leu Leu Leu Gin Gin Arg Ala Thr Glu Lys lie 
115 120 125 

Thr Phe Pro Asp Leu Trp Thr Asn Thr Cys Cys Ser His Pro Leu Cys 
130 135 " 140 

lie Asp Asp Glu Leu Gly Leu Lys Gly Lys Leu Asp Asp Lys lie Lys 
14 5 150 155 16t) 

Gly Ala lie Thr Ala Ala Val Arg Lys Leu Asp His Glu Leu <51y lie 
165 ~ 170 ' 175 

Pro Glu Asp Glu Thr Lys Thr Arg Gly Lys Phe His Phe Leu Asn Arg 
180 185 190 

lie His Tyr Met Ala Pro Ser Asn Glu Pro Trp Gly Glu His Glu lie 
195 200 205 

Asp Tyr lie Leu Phe Tyr Lys lie Asn Ala Lys Glu Asn Leu Thr Val 
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210 * 215 -"V' • 220 

Asn Pro Asn Val Asn Glu Val Arg Asp Phe Lys Trp Val Ser Pro Asn 
225 ' 230 235 240 

Asp Leu Lvs Thr Met Phe Ala Asp Pro Ser Tyr Lys Phe Thr Pro Trp 
245 250 255. 

Phe Lys lie lie Cys Glu Asn Tyr Leu Phe Asn Trp Trp Glu Gin Leu 
260 ' 265 270 

Asp Asp Leu Ser Glu Val Glu Asn Asp Arg Gin lie His Arg Met Leu 
275 280 285 

<210> 20 

<211> 456 . . 
<212> PRT 

<213> Artificial Sequence 

<220> * ' 

<223> Description of Artificial Sequence: Consensus 
sequence of four plant B-cyclases 

<400> 20 • ' 

Met Asp Thr Leu Leu Lys Thr Pro Asn Leu Glu Phe Leu Pro His Gly 
1 -5 ' 10 15 

*t • 
Phe Val Lys Ser'Phe'*Ser Lys Phe Gly Lys Cys Glu Gly Val Cys Val 
20' 25 30 

Lys Ser Ser Ala Leu Leu Glu Leu Val Pro Glu Thr Lys Lys Glu Asn 
35 40 45 

Leu Asp Phe Glu Leu Pro Met Tyr Asp Pro. Ser Lys Gly Val Val Asp 
50 55 60 

Leu Ala Val Val Gly Gly Gly Pro Ala Gly Leu Ala Val Ala Gin Gin 
65 70 75 80 

Val Ser Glu Ala Gly Leu Ser Val Cys Ser lie Asp Pro Pro Lys Leu 
. '*85 90 95 

lie Trp Pro Asn Asn Tyr Gly Val Trp Val Asp Glu Phe Glu Ala Met 
100 105 110 

Asp Leu Leu Asp Cys Leu Asp Ala Thr Trp Ser Gly Ala Val Tyr lie 
115 120 125 

Asp Asp Thr Lys Asp Leu Arg Pro Tyr Gly Arg Val Asn Arg Lys Gin 

- - 130 ^ , 135- - - -140 

Leu Lys Ser Lys Met Met Gin Lys Cys lie Asn Gly Val Lys Phe His 

145 150 155 160 

Gin Ala Lys Val lie Lys Val lie His Glu Glu Lys Ser Met Leu lie 
165 170 175 

Cys Asn Asp Gly Thr lie Gin Ala Thr Val Val Leu Asp Ala Thr Gly 
180 185 190 

Phe Ser Arg Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin 



»«1 
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195 200 205 

Val Ala Tyr Gly lie Leu Ala Glu Val Glu Glu His Pro Phe Asp Lvs 
210 215 220 

Met Val Phe Met Asp Trp Arg Asp Ser His Leu Asn Asn Glu Leu Lys 
225 • 230 235 240 

Glu Arg Asn Ser lie Pro Thr Phe Leu Tyr Ala Met Pro Phe Ser Ser 
245 250 255 

Asn Arg lie Phe Leu Glu Glu Thr Ser Leu Val Ala Ara Pro Gly Leu 
260 265 " 270 

Arg Met Asp Asp He Gin Glu Arg Met Val Ala Arg Leu His Leu Gly 
, 27,5 280 . 285 

He Lys Val. Lys Ser He Glu Glu Asp Glu His Cys Val He Pro Met 
290 295 300 

Gly Gly Pro Leu Pro Val Leu Pro Gin Arg Val Val Gly lie Gly Gly 
305 ' 310 ' 315 320 

Thr Ala Gly Met Val His Pro Ser Thr Gly Tyr Met Val Ala Arg Thr 
325 330 335 

Leu Ala Ala Ala Pro Val Val Ala Asn Ala He lie Tyr Leu Gly Ser 
340 345 , 350 

Glu Ser Ser Gly Glu Leu Ser Ala Glu Val Trp Lys Asp Leu Trp Pro 
355 360 365 

lie Glu Arg Arg Arg Gin Arg Glu Phe Phe Cys Phe Gly Met Asp lie 
370 375 380 

,Leu Leu Lys Leu Asp Leu Pro Ala Thr Arg Arg Phe Phe Asp Ala Phe 
385 390 395 400 

Phe Asp Leu Glu Pro Arg Tyr Trp His Gly Phe Leu Ser Ser Arg Leu 
405 410 415 

Phe Leu Pro Glu Leu lie Val Phe Gly Leu Ser Leu Phe Ser His Ala 
420 425 430 

Ser Asn Thr Ser Arg Glu lie Met Thr Lys Gly Thr Pro Leu Val Met 
435 440 " 445 

lie Asn Asn Leu Leu Gin Asp Glu 
450 455 

<210> 21 
<211> 524 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 21 

Met Glu Cys Val Gly Ala Arg Asn Phe Ala Ala Met Ala Val Ser Thr 
15 10 15 

Phe Pro Ser Trp Ser Cys Arg Arg Lys Phe Pro Val Val Lys Arg Tyr 
20 25 30 
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Ser Tyr Arg Asn lie Arg Phe Gly Leu 'Cy : s Sef Val Arg Ala Ser Gly 
35 40 45 

Gly Gly Ser Ser Gly Ser Glu Ser Cys Val Ala Val Arg Glu Asp Phe 
50 55 -60 

Ala Asp Glu Glu Aso Phe Val Lys Ala Gly Gly Ser Glu lie Leu ' Phe 
65 -70 75 80 

Val Gin Met Gin Gin Asn Lys Asp Met Asp Glu Gin Ser Lys Leu Val 
85 90 95 

Asp Lys Leu Pro Pro lie Ser lie Gly Asp Gly Ala Leu Asp His Val 
100 105 . 110 

Val lie Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala 
115' 120 125 

Lys Leu Gly Leu Lys Val Gly Leu lie Gly Pro- Asp 'Leu Pro Phe Thr ' ' . 
130 " 135 140 

Asn Asn Tyr Gly Val Trp Glu Asp Glu Phe Asn Asp Leu- Gly Leu Gin 
145 " * 150 155- 160 

Lys Cys lie Glu His Val Trp Arg Glu Thr lie Val Tyr Leu Asp Asp 
165 170 175 ' 

Asp Lys Pro lie. Thr lie Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg 
180 185 190 

Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu Ser Gly Val Ser 
195 200 , 205 

Tyr Leu Ser Ser Lys Val Asp Ser lie Thr Glu Ala Ser Asp Gly Leu 
210 215 220 

Arg Leu Val Ala Cys Asp Asp Asn Asn Val lie Pro Cys Arg Leu Ala 
225 230 235 240 

Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val 
' 245 250 255 

Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu 
260 265 270 

Val Glu Asn Ser Pro Tyr Asp Pro Asp Gin Met Val Phe Met Asp Tyr 
275 280 285 

Arg Asp Tyr Thr Asn Glu Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro 

- 290 - ' • 295 300 

Thr Phe Leu Tyr Ala Met Pro Met Thr Lys Ser Arg Leu Phe Phe Glu 
305 ~ 310 315 320 

Glu Thr Cys Leu Ala Ser Lys Asp Val Met Pro Phe Asp Leu Leu Lys 
325 330 335 

Thr Lys Leu Met Leu Arg Leu Asp Thr Leu Gly lie Arg lie Leu Lys 
340 " 345 350 

Thr Tyr Glu Glu Glu Trp Ser tyr lie Pro Val Gly Gly Ser Leu Pro 
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355 360 365 

Asn Thr Glu *Gln Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val 
370 375 380 

His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro 
385 ' 390 395 400 

Lys Tyr Ala Ser Val lie Ala Glu lie Leu Arg Glu Glu Thr Thr Lys 
4 05 410 4 15 

Gin lie Asn Ser Asn lie Ser Arg Gin Ala Trp Asp Thr Leu Trp Pro 
420 4^5 430 

Pro Glu 'Arg Lys Arg Gin Arg Ala Phe Phe Leu Phe Gly Leu Ala Leu 
435 440 ■ 445 

lie Val Gin Phe Asp Thr Glu Gly lie Arg Ser Phe Phe Arg Thr Phe 
450 455 460 

Phe Arg Leu Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Thr Leu 
465 470 "* 475 480 

Thr Ser Gly Asp Leu Val Leu Phe Ala Leu Tyr Met Phe Val lie Ser 
485 490 ~* 495 

Pro Asn Asn Leu Arg Lys Gly Leu lie Asn His Leu lie Ser Asp Pro 
'500 505 , 510 

Thr Gly Ala Thr Met lie Lys Thr Tyr Leu Lys Val 
515 520 

<210> 22 
<211> 1898 
<212> DNA 
,<213> Adonis palaestina 

<400> 22 



AAAGGAGTGT 


TCTATTAATG 


TTACTGTCGC 


ATTCTTGCAA 


CACTTATATT 


CAAACTCCAT 


60 


TTTCTTCTTT 


TCTCTTCAAA 


ACAACAAACT 


AATGTGAGCA 


GAGTATCTGG 


CTATGGAACT 


120 


ACTTGGTGTT 


CGCAACCTCA 


TCTCTTCTTG 


CCCTGTGTGG 


ACTTTTGGAA 


CAAGAAACCT 


180 


TAGTAGTTCA 


AAACTAGCTT 


ATAACATACA 


TCGATATGGT 


TCTTCTTGTA 


GAGTAGATTT 


240 


TCAAGTGAGA 


GCTGATGGTG 


GAAGCGGGAG 


TAGAAGTTCT 


GTTGCTTATA 


AAGAGGGTTT 


300 


TGTGGATGAA 


GAGGATTTTA 


TCAAAGCTGG 


TGGTTCTGAG 


CTTTTGTTTG 


TCCAAATGCA 


360 


GCAAACAAAG 


TCTATGGAGA 


AACAGGCCAA 


GCTCGCCGAT 


AAGTTGCCAC 


CAATACCTTT 


420 


TGGAGAATCC 


GTGATGGACT 


TGGTTGTAAT 


AGGTTGTGGA 


CCTGCTGGTC 


TTTCACTGGC 


480 


TGCAGAAGCT 


GCTAAGCTAG 


GGTTGAAAGT 


TGGCCTTATT 


GGTCCTGATC 


TTCCTTTTAC 


540 


AAATAATTAT 


GGTGTGTGGG 


AAGACGAGTT 


CAAAGATCTT 


GGACTTGAAC 


GTTGTATCGA 


600 


GCATGCTTGG 


AAGGACACCA 


TCGTATATCT 


TGATAATGAT 


GCTCCTGTCC 


TTATTGGTCG 


660 


TGCATATGGA 


CGAGTTAGTC 


GACATTTGCT 


ACATGAGGAG 


TTGCTGAAAA 


GGTGTGTGGA 


720 
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GTCAGGTGTA TCATATCTTG ATTCTAAAGT GGAAAGGATC ACTGAAGCTG GTGATGGCCA '7 80 
TAGCCTTGTA GTTTGTGAAA ATGAGATCTT TATCCCTTGC AGGCTTGCTA CTGTTGCATC 84 0 

TGGAGCAGCT TCAGGGAAAC TTTTGGAGTA TGAAGTAGGT GGCCCTCGTG TTTGTGTCCA 900 
AACCGCTTAT GGGGTGGAGG TTGAGGTGGA GAACAATCCA TACGATCCCA ACTTAATGGT 960 

ATTCATGGAC TACAGAGACT ATATGCAACA GAAATTACAG TGCTCGGAAG AAGAATATCC 1020 

AACATTTCTC TATGTCATGC CCATGTCGCC AACAAGACTT TTTTTTGAGG AAACCTGTTT 1080 

GGCCTCAAAA GATGCCATGC CATTCGATCT ACTGAAGAGA AAACTGATGT CACGATTGAA 1140 

GACTCTGGGT ATCCAAGTTA CAAAAGTTTA TGAAGAGGAA TGGTCATATA TTCCTGTTGG 1200 

TGGTTCTTTA CCAAACACAG AGCAAAAGAA CCTAGCATTT GGTGCTGCAG CAAGCATGGT 12 60 

GCATCCAGCA ACAGGCTATT CGGTTGTACG GTCACTGTCA GAAGCTCCAA AATATGCTTC .1320 

TGTAATTGCA AAGATTTTGA AGCAAGATAA CTCTGCGTAT GTGGTTTCTG GAC AAAG TAG 138 0 

TGCAGTAAAC ATTTCAATGC AAGCATGGAG CAGTCTTTGG CCAAAGGAGC GAAAACGTCA 14 4 0 

AAGAGCATTC TTTCTTTTTG GATTAGAGCT TATTGTGCAG CTAGATATTG AAGCAACCAG 1500 

AACATTCTTT AGAACCTTQT ,TCCGCTTGCC AACTTGGATG TGGTGGGGTT TCCTTGGGTC 15 60 

TTCACTATCA TCTTTCGATC TCGTCTTGTT TTCCATGTAC ATGTTTGTTT TGGCGCCAAA 1620 

CAGCATGAGG ATGTCACTTG TGAGACATTT GCTTTCAGAT CCTTCTGGTG CAGTTATGGT 1680 

AAGAGCTTAC CTCGAAAGGT AGTCTCATCT ATTATTAAAC TCTAGTGTTT CACCAAATAA 17 4 0 

ATGAGGATCC TTCGAATGTG TATATGATCA TCTCTATGTA TATCCTGTAC TCTAATGTCA 1800 

TAAAGTAAAT GCCGGGTTTG ATATTGTTGT GTCAAACCGG CCAATGATAT AAAGTAAATT 18 60 

TATTGATACA AAAGTAGTTT TTTTCCTTAA AAAAAAAA 18 98 

<2io> 23 , r M 

<211> 529 
<212> PRT 

<213> Adonis palaestina 
<400> 23 

Met Glu Leu Leu Gly Val Arg Asn Leu lie Ser Ser Cys Pro Val Trp 
1 5 10 15 

Thr- Phe Gly Thr Arg Asn Leu-Ser Ser -Ser Lys Leu Ala Tyr Asn lie 

20 25 30 

His Arg Tyr Gly Ser Ser Cys Arg Val Asp Phe Gin Val Arg Ala Asp 
35 40 45 

Gly Gly Ser Gly Ser Arg Ser Ser Val Ala Tyr Lys Glu Gly Phe Val 

50 55 60 

Asp Glu Glu Asp Phe lie Lys Ala Gly Gly Ser Glu Leu Leu Phe Val 
65 70 75 80 
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Gin Met Gin Gin Thr Lys Ser Met Glu Lys Gin Ala Lys Leu Ala Asp 

85 90 95 

Lys Leu Pro Fro lie Pro Phe Gly Glu Ser Val Met Asp Leu Val Val 
100 105 110 

lie Gly Cys Gly Pro Ala Gly Leu Ser Leu Ala Ala Glu Ala Ala Lys 
115 120 125 

Leu Gly Leu Lys Val Gly Leu lie Gly Pro Asp Leu Pro Phe Thr Asn 
130 135 140 

Asn Tyr Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Glu Arg 
145 150 155 160 

Cys He Glu His Ala Trp Lys Asp Thr He Val Tyr Leu Asp Asn Asp 
165 170 " 175 

Ala Pro Val Leu He Gly Arg Ala Tyr Gly Arg Val Ser Arg His Leu 
180 185 190 

Leu His* Glu Glu Leu Leu Lys Arg Cys' Val Glu Ser Gly Val Ser Tyr 
195 200 " 205 

Leu Asp Ser Lys Val Glu Arg He Thr Glu Ala Gly Asp Gly His Ser 
210 215 220 

Leu Val Val 'Cys Glu Asn Glu lie Phe lie Pro Cys, Arg Leu Ala Thr 
225 230 235 240 

Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Glu Tyr Glu Val Gly 
245 250 255 

Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val 
260 265 * 270 

Glu Asn Asn Pro Tyr Asp Pro Asn Leu Met Val Phe Met Asp Tyr Arg 
275 280 285 

Asp Tyr Met Gin Gin Lys Leu Gin Cys Ser Glu Glu Glu Tyr Pro Thr 
290 295 300 

Phe Leu Tyr* Val Met Pro Met Ser Pro Thr Arg Leu Phe Phe Glu Glu 
305 310 315 320 

Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Arg 
325 330 335 

Lys Leu Met Ser Arg Leu Lys Thr Leu Gly He Gin Val Thr Lys Val 
340 345 350 

Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser Leu Pro Asn 
355 360 365 

Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His 
370 375 380 

Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys 
385 390 395 400 

Tyr Ala Ser Val lie Ala Lys lie Leu Lys Gin Asp Asn Ser Ala Tyr 
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4 05 '4 i ; 0 • 415 

Val Val Ser Gly Gin Ser Ser Ala Val Asn lie Ser Met Gin Ala Trp 
420 425 430 

Ser Ser Leu Trp Pro Lys Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu 
435 440 445 

Phe Gly Leu Glu Leu lie Val Gin Leu Asp lie Glu Ala Thr Arg Thr 
450 455 460 

Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp Trp Gly Phe 
465 470 475 480 

Leu Gly Ser Ser Leu Ser Ser Phe Asp Leu Val Leu Phe Ser Met Tyr 
, ■ ,4 85 4 90 ■ 4 95 

Met Phe Val Leu* Ala Pro Asn Ser Met Arg Met Ser Leu Val Arg His 
500 ■ . . 505 , . • 510 ■ • 

Leu Leu Ser Asp Pro Ser Gly Ala Val Met Val Arg Ala Tyr Leu Glu 
515 . 520 ' 52.5. 

Arg 

<210> 24 ,\ ' ' 

<211> 1370 
<212> DNA 
<213> Potato 

<400> 24 

TAGCGGAGGA TGAGTTCAAA GATCTTGGTC TTCAAGCCTG CATTGAACAT GTTTGGCTGG 60 

GATACCATTG TATATCTTGA T GAT GAT GAT CCTATTCTTA TTGGCCGTGC CTATGGAAGA 120 

GTTAGTCGCC ATTTACTGCA CGAGGAGTTA CTCAAAAGGT GTGTGGAGGC AGGTGTTTTG 180 

TATCTAAACT CGAAAGTGGA TAGGATTGTT GAGGCCACAA ATGGCCACAG TCTTGTAGAG 24 0 

TGCGAGGGTG ATGTTGTGAT TCCCTGCAGG TTTGTGACTG TTGCATCGGG AGCAGCCTCG 300 

GGGAAATTCT TGCAGTATGA GTTGGGAGGT CCTAGAGTTT CTGTTCAAAC AGCTTATGGA 360 

GTGGAAGTTG AGGTCGATAA CAATCCATTT GACCCGAGCC TGATGGTTTT CATGGATTAT 4 20 

AGAGACTATG TCAGACACGA CGCTCAATCT TTAGAAGCTA AATATCCAAC ATTTCTCTAT 4 80 

GCCATGCCCA TGTCTCCAAC ACGAGTCTTT TTCGAGGAAA CTTGTTTGGC TTCAAAAGAT 54 0 

GCAATGCCAT TCGATCTGTT AAAGAAAAAA TTGATGTTAC GATTGAACAC CCTCGGTGTA 600 

AGAATTAAAG AAATTTATGA GGAGGAATGG TCTTACATAC CAGTTGGAGG ATCTTTGCCA 660 

AATACAGAAC AAAAAACACT TGCATTTGGT GCTGCTGCTA GCATGGTTCA TCCAGCCACA 720 

GGTTATTCAG TCGTCAGATC ACTGTCTGAA GCTCCAAAAT GCGCCTTCGT GCTTGCAAAT 7 80 

AT ATT AC G AC AAAATCATAG CAAGAATATG CTTACTAGTT CAAGTACCCC GAGTATTTCA 84 0 

ACTCAAGCTT GGAACACTCT TTGGCCACAA GAACGAAAAC GACAAAGATC GTTTTTCCTA 900 
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TTTGGACTGG CTCTGATATT GCAGCTGGAT ATTGAGGGGA TAAGGTCATT TTTCCGCGCG 960 

TTCTTCCGTG TGCCAAAATG GATGTGGCAG GGATTTCTTG GTTCAAGTCT TTCTTAGCAG 1020 

ACCTCATGTT ATTTGCCTTC TACATGTTTA TTATTGCACC AAATGACATG AGAAGAGGCT 1080 * 

TAATCAGACA TCTTTTATCT GATCCTACTG GTGCAACATT GATAAGAACT TATCTTACAT 1140 

T TT AG AG T AA ATTCCTCCTA CAATAGTTGT TGAAAGAGGC CTCATTACTT CAGATTCATA 1200 

ACAGAAATCG CGGTCTCTCG AGGCCTTGTA TATAACATTT TCACTAGGTT AATATTGCTT 1260 

GAATAAGTTG CACAGTTTCA GTTTTTGTAT CTGCTTCTTT TTTGTCCAAG ATCATGTATT 1320 

GACCAATTTA TATACATTGC CAGTATATAT AAATTTTATA AAAAAAAAAA 1370 

<210> 25 
<211> 377 
<212> PRT 
<213> Potato 

<400> 25* 

Asp Glu Phe Lys Asp Leu Gly Leu Gin Ala Cys lie Glu His Val Trp 
1 5 10 15 

Arg Asp Thr lie Val Tyr Leu Asp Asp Asp Asp Pro lie Leu lie Gly 
20 25 30 

Arg Ala Tyr Gly Arg Val Ser Arg His Leu Leu His Glu Glu Leu Leu 
35 40 45 

Lys Arg Cys Val Glu Ala Gly Val Leu Tyr Leu Asn Ser Lys Val Asp 
50 55 60 

Arg lie Val Glu Ala Thr Asn Gly His Ser Leu Val Glu Cys Glu Gly 
. 65 70 75 80 

Asp Val Val lie Pro Cys Arg Phe Val Thr Val Ala Ser Gly Ala Ala 

85 90 95 

Ser Gly Lys Phe Leu Gin Tyr Glu Leu Gly Gly Pro Arg Val Ser Val 
100 105 ' 110 

Gin Thr Ala Tyr Gly Val Glu Val Glu Val Asp Asn Asn Pro Phe Asp 
115 120 125 

Pro Ser Leu Met Val Phe Met Asp Tyr Arg Asp Tyr Val Arg His Asp 
130 135 140 

Ala Gin Ser Leu Glu Ala Lys Tyr Pro Thr Phe Leu Tyr Ala Met Pro 
145 150 155 160 

Met Ser Pro Thr Arg Val Phe Phe Glu Glu Thr Cys Leu Ala Ser Lys 
165 170 " 175 

Asp Ala Met Pro Phe Asp Leu Leu Lys Lys Lys Leu Met Leu Arg Leu 
180 185 190 

Asn Thr Leu Gly Val Arg lie Lys Glu lie Tyr Glu Glu Glu Trp Ser 
195 200 205 
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Tyr lie Pro Val Gly Glv Ser Leu Pro Ash Thr Glu Gin Lys Thr Leu 
210 ' " 215 ■ 220 

Ala Phe Gly Ala Ala Ala Ser Met Val His Pro Ala Thr Gly Tyr Ser 
225 "* 230 235 240 

Val Val Arg Ser Leu Ser Glu Ala Pro Lys Cvs Ala Phe Val Leu Ala 
245 250 255 

Asn lie Leu Arg Gin Asn His Ser Lys Asn Met Leu Thr Ser Ser. Ser 
260 265 270 

Thr Pro Ser lie Ser Thr Gin Ala Trp Asn Thr Leu Trp Pro Gin Glu 
275 280 , 285 

Arg Lys Arg Gin ^rg Ser Phe Phe Leu Phe Gly Leu Ala Leu lie Leu 
290 295 300 

Gin Leu Asp lie Glu Gly lie Arg Ser Phe Phe Arg Ala Phe Phe Arg 
305 310 315 ' 320 

Val Pro Lys Met Met Trp .Gly Phe Leu Gly Ser Ser' Leu . Ser Xaa Ala 
325 4 330 . 335 

Asp Leu Met Leu Phe Ala Phe Tyr Met Phe lie lie Ala Pro Asn Asp 
340 345 350 

Met Arg Arg Gly Leu lie Arg His Leu Leu Ser Asp Pro Thr Gly Ala 
355 ' 360 365 

Thr Leu lie Arg Thr Tyr Leu Thr Phe 
370 375 

<210> 26 

<211> 533 

<212> PRT 

<213> Chimeric lettuce/potato 

<400> 26 

Met Glu Cys Phe £l.y Ala Arg Asn Met Thr Ala Thr Met Ala Val Phe 
1 • • 5 10 15 

Thr Cys Pro Arg Phe Thr Asp Cys Asn lie Arg His Lys Phe Ser Leu 
20 25 30 

Leu Lys Gly Arg Arg Phe Thr Asn Leu Ser Ala Ser Ser Ser Leu Arg 
35 ^ 40 4 5 

Gin lie Lys Cys Ser Ala Lys Ser Asp Arg Cys Val Val Asp Lys Gin 

50 - - ■ - 55 - . 60 

Gly He Ser Val Ala Asp Glu Glu Asp Tyr Val Lys Ala Gly Gly Ser 
65 70 75 80 

Glu Leu Phe Phe Val Gin Met Gin Arg Thr Lys Ser Met Glu Ser Gin 
85 90 95 

Ser Lys Leu Ser Glu Lys Leu Ala Gin He Pro He Gly Asn Cys He 
100 ~ 105 110 

Leu Asp Leu Val Val lie Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala 
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115 120 125 

Ala Glu Ser Ala Lys Leu Gly Leu Asn Val Gly Leu lie Gly Pro Asp 
130 135 140 

Leu Pro Phe Thr Asn Asn Tyr Gly Val Trp Gin Asp Glu Phe lie Gly 
145 150 155 * 160 

Leu Gly Leu Glu Gly Cys lie Glu His Ser Trp Lys Asp Thr Leu Val 
165 170 * 175 

Tyr Leu Asp Asp Ala Asp Pro lie Arg lie Gly Arg Ala Tyr Gly Arg 
180 185 190 

Val His* Arg Asp Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu 
195 200 . 205 

Ser Gly Val Ser Tyr Leu Ser Ser Lys Val Glu Arg lie Thr Glu Ala 
210 215 220 

Pro Asn Gly Tyr Ser Leu lie Glu Cys Glu Gly Asn lie Thr lie Pro 
225 ' 230 ' 235 240 

Cys Arg Leu Ala Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Phe Leu 
245 250 255 

Glu Tyr Glu Leu Gly Gly Pro Arg Val Ser Val Gin Thr Ala Tyr Gly 
'260 265 , 270 

Val Glu Val Glu Val Asp Asn Asn Pro Phe Asp Pro Ser Leu Met Val 
275 280 285 

Phe Met Asp Tyr Arg Asp Tyr Val Arg His Asp Ala Gin Ser Leu Glu 
290 295 300 

Ala Lys Tyr Pro Thr Phe Leu Tyr Ala Met Pro Met Ser Pro Thr Arg 
305 310 315 320 

Val Phe Phe Glu Glu Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe 
325 330 L 335 

Asp Leu Leu Lys Lys Lys Leu Met Leu Arg Leu Asn Thr Leu Gly Val 
.340 345 350 

Arg lie Lys Glu He Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly 
355 360 365 

Gly Ser Leu Pro Asn Thr Glu Gin Lys Thr Leu Ala Phe Gly Ala Ala 
370 375 380 

Ala Ser Met Val His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu 
385 390 395 ~ 400 

Ser Glu Ala Pro Lys Cys Ala Phe Val Leu Ala Asn He Leu Arg Gin 
405 410 415 

Asn His Ser Lys Asn Met Leu Thr Ser Ser Ser Thr Pro Ser He Ser 
420 425 430 

Thr Gin Ala Trp Asn Thr Leu Trp Pro Gin Glu Arg Lys Arg Gin Arg 
4 35 '4 40 44 5 
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Ser Phe Phe Leu Phe Gly Leu Ala Leu' lie Leu Gin Leu Asp lie Glu 
4 50 " 4 55 '4 60 

Gly lie Arg Ser Phe Phe Arg Ala Phe Phe Arg Val Pro Lys Trp Met 
465 470 475 ' 480 

Trp Gin Gly Phe Leu Gly Ser Ser Leu Ser Xaa Ala Asp Leu Met' Leu 
•485 490 495 

Phe Ala Phe Tyr Met Phe He He Ala Pro Asn Asp Met Arg Arg Gly 
500 505 510 

Leu He Arg His Leu Leu Ser Asp Pro Thr Gly Ala Thr Leu He Arg 
515 520 . 525 

Thr Tyr Leu" Thr- Phe 
530 

<210> 27 1 • ■ ■ .- ■ . ■ 1 . 

<211> 374 
<212> PRT 

<213> Arabidopsis thaliana 

* • 

<400> 27 

Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin Lys Cys He Glu His Val 
1 5 10 15 ' 

Trp Arg Glu Thr lie Val Tyr Leu Asp Asp Asp Lys Pro He Thr He 
20 25 30 

Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg Leu Leu His Glu Glu Leu 

35 40 . ... 45 

Leu Arg Arg Cys Val Glu Ser Gly Val Ser Tyr Leu Ser Ser Lys Val 
50 55 60 

Asp Ser He Thr Glu Ala Ser Asp Gly Leu Arg Leu Val Ala Cys Asp 
65 70 75 80 

Asp Asn Asn Val* -He Pro Cys Arg Leu Ala Thr Val Ala Ser Gly Ala 
' ' 85 90 95 

Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val Gly Gly Pro Arg Val Cys 
100 105 110 

Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val Glu Asn Ser Pro Tyr 
115 120 125 

Asp Pro Asp Gin Met Val Phe Met Asp Tyr Arg Asp Tyr Thr Asn Glu 
130 135 14 0 

Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro Thr Phe Leu Tyr Ala Met 
145 150 155 160 

Pro Met Thr Lys Ser Arg Leu Phe Phe Glu Glu Thr Cys Leu Ala Ser 
165 170 175 

Lys Asp Val Met Pro Phe Asp Leu Leu Lys Thr Lys Leu Met Leu Arg 
180 185 ' 190 

Leu Asp Thr Leu Gly He Arg lie Leu Lys Thr Tyr Glu Glu Glu Trp 
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195 200 205 

Ser Tyr lie Pro Val Gly Gly Ser Leu Pro Asn Thr Glu Gin Lys Asn 
210 215 220 

Leu Ala Phe Gly Ala Ala Ala Ser Met Val His Pro Ala Thr Gly Tyr 
225 ' 230 235 240 

Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys Tyr Ala Ser Val lie 
245 250 ^ 255 

Ala Glu lie Leu Arg Glu Glu Thr Thr Lys Gin lie Asn Ser Asn lie 
260 265 270 

Ser Arg Gin Ala Trp Asp Thr Leu Trp Pro Pro Glu Arg Lys Arg Gin 
, 27.5 280 • 285 

Arg Ala Phe. Phe Leu Phe Gly Leu Ala Leu lie Val Gin Phe Asp Thr 
290 295 300 

Glu Gly lie Arg Ser Phe Phe Arg Thr Phe Phe Arg Leu Pro Lys Trp 
305 310 315 " 320 

Met Trp Gin Gly Phe Leu Gly Ser Thr Leu Thr Ser Gly Asp Leu Val 
325 330 335 

Leu Phe Ala Leu Tyr Met Phe Val lie Ser Pro Asn Asn Leu Arg Lys 
34 0 34 5 , 350 

Gly Leu lie Asn His Leu lie Ser Asp Pro Thr Gly Ala Thr Met lie 
355 360 365 

Lys Thr Tyr Leu Lys Val 
370 

.<210> 28 

<211> 1002 

<212> DNA 

<213> Adonis palaestina 

<400> 28 



ATTCATCTTC 


AGCAGCGCTG 


TCGTACTCTT 


TCTATATCTT 


CTTCCATCAC 


TAACAGTAGT 


60 


CGCCGACGGT 


TGAATCGGCT 


ATTCGCCTCA 


ACGTCAACTA 


TGGGTGAAGT 


CACTGATGCT 


120 


GGAATGGATG 


CTGTTCAGAA 


GCGGCTCATG 


TTCGACGACG 


AATGTATTTT 


GGTGGATGAG 


180 


AATGACAAGG 


TCGTCGGGCA 


TGATTCCAAA 


TACAACTGTC 


ATTTGATGGA 


AAAGATAGAG 


240 


GCAGAAAATT 


TGCTTCACAG 


AGCCTTCAGT 


GTTTTCTTGT 


TCAACTCAAA 


ATATGAATTG 


300 


CTTCTTCAGC 


AACGATCCGC 


CACAAAGGTA 


ACATTCCCGC 


TCGTATGGAC 


AAACACATGT 


360 


TGCAGTCATC 


CTCTCTTTCG 


TGATTCCGAG 


CTCATAGAAG 


AAAATTATCT 


CGGTGTACGA 


420 


AACGCTGCAC 


AAAGAAAGCT 


TTTAGACGAG 


CTAGGCATTC 


CAGCTGAAGA 


TGTCCCAGTT 


480 


GATGAATTTA 


CTCCTCTTGG 


TCGCATTCTT 


TACAAAGCTC 


CATCTGACGG 


CAAATGGGGA 


540 


GAGCACGAAT 


TGGACTATCT 


CCTATTTATT 


GTCCGAGATG 


TGAAATACGA 


TCCAAACCCA 


600 
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GATGAAGTTG 


CTGATGCTAA 


GTATGTTAAT 


CGCGAGGAGT 


TGAGAGAGAT 


ACTGAGAAAA 


660 


GCTGATGCTG 


GTGAAGAGGG 


ACTCAAGTTG 


TCTCCTTGGT 


TTAGATTGGT 


TGTTGATAAC 


720 


TTTTTGTTCA 


AGTGGTGGGA 


TCATGTAGAG 


CAGGGTACGA 


TTAAGGAAGT 


TGCTGACATG 


780 


AAAACTATCC 


ACAAGTTGAC 


TTAAGAGGAC 


TTCTCTCCTC 


TGTTCTACTA 


TTTGTTTTTT 


840 


GCTACAATAA 


GTGGGTGGTG 


ATAAGCAGTT 


TTTCTGTTTT 


CTTTAATTTA 


TGGCTTTTGA 


900 


ATTTGCCTCG 


ATGTTGAACT 


TGTAACATAT 


TTAGACAAAT 


ATGAGACCTT 


GTAAGTTGAA 


960 


TTTGAGGCTG 


AATTTATATT 


TTTGGGAACA 


1 r\r\ 1 rvi loll 


an 

nn 




i no? 


<210> 29 
<211> 1271 . 
<212> DNA 

<213> Adonis palaestina 










<400> 29 

rjy prt fry fry tv tv tv f** rri 

1 T J 1 AAACaC I 


CTTTCGCTCC 


ACCACCATCA 


• • 

AAGCCAGCCA 


AATTTCTCTG 


TACAAAAGTT 


SO 


7\7\ TV 7\ 7\ r~ 

AAAAACACCG 


CTTTGGGCTT 


TGGCCCCTCC 


ATATCGGAAT. 


CCTTGTTTAC 


GATACGCATC 


M20 


TAAACCAGTA 


ATTCTCGGTT 


TTAATTTGTT 


TCCTAAATTA 


GGCCCCTTTC 


CGGAATCCCG 


180 


A(j AA I lAIbl 


CGTCGATC&G GATTAATCCT 


TTATATAGTA 


TCTTCTCCAC 


CACCACTAAA 


240 


AL, A 1 1 A I LAb 


CTTCGTGTTC 


TTCTCCCGCT 


GTTCATCTTC 


AGCAGCGTTG 


TACGTACTCT 


300 


1 1 Llrti i It 1 


TCTTCCATCA 


CTAACAGTCC 


TCGCCGAGGG 


TTGAATCGGC 


TGTTCGCCTC 


360 


a a r* <z t rr^ a r* t 


ATGGGTGAAG 


TCGCTGATGC 


TGGTATGGAT 


GCCGTCCAGA 


AGCGGCTTAT 


420 




GAATGTATTT 


TGGTGGATGA 


GAATGACAAG 


GTCGTCGGAC 


ATGATTCCAA 


480 


ATACAACTGT 


CATTTGATGG 


AAAAGATAGA 


GGCAGAAAAC 


TTGCTTCACA 


GAGCCTTCAG 


540 


TGTTTTCTTA 


TTCAACTCAA 


AATACGAGTT 


GCTTCTTCAG 


CAACGATCTG 


CAACGAAGGT 


600 


AACATTCCCG 


CTCGTATGGA 


CAAACACCTG 


TTGCAGCCAT 


CCCCTCTTCC 


GTGATTCCGA 


660 


ACTCATAGAA 


GAAAATTTTC 


TCGGGGTACG 


AAACGCTGCA 


CAAAGGAAGC 


TTTTAGACGA 


720 


GCTAGGCATT 


CCAGCTGAAG 


ACGTACCAGT 


TGATGAATTC 


ACTCCTCTTG 


GTCGCATTCT 


780 


TTACAAAGCT 


CCATCTGACG 


GAAAATGGGG 


AGAGCACGAA 


CTGGACTATC 


TTCTGTTTAT 


840 


TGTCCGAGAT 


GTGAAATACG 


ATCCAAACCC 


AGATGAAGTT 


GCTGACGCTA 


AGTACGTTAA 


900 


TCGCGAGGAG 


TTGAAAGAGA 


TACTGAGAAA 


AGCTGATGCA 


GGTGAAGAGG 


GAATAAAGTT 


960 


GTCTCCTTGG 


TTTAGATTGG 


TTGTGGATAA 


CTTTTTGTTC 


AAGTGGTGGG 


ATCATGTAGA 


1020 


GGAGGGGAAG 


ATTAAGGACG 


TCGCCGACAT 


GAAAACTATC 


CACAAGTTGA 


C T T AAG AG AA 


1080 


AGTCTCTTAA 


GTTCTACTAT 


TTGGTTTTTG 


CTTCAATAAG 


TGGATGGTGA 


TGAGCAGTTT 


1140 


TTATGCTTCC 


TTTAATTTTG 


GCTTTTCAAT 


TTGCTTTATG 


TGTTGAACTT 


GTAACATATT 


1200 


TAGTCAAATA 


TGAGACCTTG 


TGAGTTGAAT 


TTGAGGTTAT 


ATTTATAGTT 


TTGGGAACAT 


1260 



3,. 
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AAAAAAAAAA A 

<210> 30 
<211> 1109 
<212> DNA 

<213> Haematococcus pluvialis 
<400> 30 



TGGAACCTGG 


CCCGGCGGCA 


GTCCGATGCC 


GCGATGCTTC 


GTTCGTTGCT 


CAGAGGCCTC 


60 


ACGCATATCC 


CGCGCGTGAA 


CTCCGCCCAG 


CAGCCCAGCT 


GTGCACACGC 


GCGACTCCAG 


120 


TTTAAGCTCA 


GGAGCATGCA 


GCTGCTTGCC 


GAGGACCGCA 


CAGACCACAT 


GAGGGGTGCA 


180 


AGCACCTGGG 


CAGGCGGGCA 


GTCGCAGGAT 


GAGCTGATGC 


TGAAGGACGA 


GTGCATCTTA 


240 


GTGGATGCTG 


ACGACAACAT 


CACAGGCCAT 


GCCAGCAAGC 


TGGAGTGCCA 


CAAATTCCTA 


300 


CCACATCAGC 


CTGCAGGCCT 


GCTGCACCGG 


GCCTTCTCTG 


TGTTCCTGTT 


TGACGACCAG 


360 


GGGCGACTGC 


TGCTGCAACA 


GCGTGCACGC 


TCAAAAATCA 


CCTTCCCAAG 


TGTGTGGACG 


420 


AACACCTGCT 


GCAGCCACCC 


TCTACATGGG 


CAGACCCCAG 


ATGAGGTGGA 


CCAACTAAGC 


480 


CAGGTGGCCG 


ACGGCACAGT 


ACCTGGCGCA 


AAAGCTGCTG 


CCATCCGCAA 


GTTGGAGCAC 


540 


GAGCTGGGGA 


TACCAGCGCA 


CCAGCTGCCG 


GCAAGCGCGT 


TTCGCTTCCT 


CACGCGTTTG 


600 


CACTACTGTG 


CCGCGGACGT 


GCAGCCGGCT 


GCGACACAAT 


CAGCGCTCTG 


GGGCGAGCAC 


660 


GAGATGGACT 


ACATCTTATT 


CATCCGGGCC 


AACGTCACCT 


TGGCGCCCAA 


CCCTGACGAG 


720 


GTGGACGAAG 


TCAGGTACGT 


GACGCAAGAG 


GAGCTGCGGC 


AGATGATGCA 


GCCGGACAAC 


780 


GGGTTGCAAT 


GGTCGCCGTG 


GTTTCGCATC 


ATCGCCGCGC 


GCTTCCTTGA 


GCGTTGGTGG 


840 


GCTGACCTGG 


ACGCGGCCCT 


AAACACTGAC 


AAACAC GAGG 


ATTGGGGAAC 


GG TGCATC AC 


900 


ATCAACGAAG 


CGTGAAGGCA 


GAAGCTGCAG 


GATGTGAAGA 


CACGTCATGG 


GGTGGAATTG 


960 


CGTACTTGGC 


AGCTTCGTAT 


CTCCTTTTTC 


T GAG AC T G AA 


CCTGCAGAGC 


TAGAGTCAAT 


1020 


GGTGCATCAT 


ATTCATCGTC 


TCTCTTTTGT 


T T TAG AC T AA 


TCTGTAGCTA 


GAGTCACTGA 


1080 


TGAATCCTTT 


ACAACTTTCA 


AAAAAAAAA 








1109 


<210> 31 
<211> 985 
<212> DNA 

<213> Lactuca sativa 












<400> 31 
TGCCAAAATG 


TTGAAATTTC 


CCCCTTTTAA 


AACCATTGCT 


ACCATGATCT 


CTTCTCCATA 


60 


TTCTTCCTTC 


TTGCTGCCTC 


GGAAATCTTC 


TTTCCCTCCA 


ATGCCGTCTC 


TCGCAGCCGC 


120 


TAGTGTTTTC 


CTCCACCCTC 


TTTCGTCTGC 


CGCTATGGGC 


GATTCCAGCA 


TGGATGCTGT 


180 


CCAGCGACGT 


CTCATGTTCG 


ATGACGAATG 


CATTTTGGTG 


GATGAGAATG 


ACAAAGTGGT 


240 


TGGCCATGAT 


ACTAAATACA 


ATTGTCATTT 


GATGGAGAAG 


ATTGAAAAGG 


G AAAT ATGC T 


300 



12-. 
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ACACAGAGCA 


TTCAGTGTGT 


TCTTGTTCAA 


CTCG'AAATAT 


GAATTACTCC 


TTCAGCAACG 


360 


TTCTGCAACC 


AAGGTGACTT 


TCCCTTTGGT 


ATGGACAAAC 


ACGTGTTGCA 


GCCATCCACT 


420 


ATACAGGGAG 


AGTGAGCTTA 


TTGACGAAAA 


CGCCCTTGGG 


GTGAGGAATG 


CTGCACAGAG 


' 4B0 


GAAGCTCCTG 


GATGAACTCG 


GCATCCCTGG 


AGCAGATGTT 


CCGGTTGATG 


AGTT.CACTCC 


540 


ATTGGGTCGC 


ATTCTATACA 


AGGCCGCATC 


GGATGGAAAG 


TGGGGAGAAC 


ATGAACTTGA 


600 


TTACCTGCTG 


TTTATGGTAC 


GTGATGTTGG 


TTTGGATCCG 


AACCCAGATG 


AAGTGAAAGA 


660 


TGTAAAATAT 


GTGAACCGGG 


AAGAGCTGAA 


GGAATTGGTA 


AGGAAGGCGG 


ATGCTGGTGA 


720 


AGAGGGTGTG 


AAGCTGTCCC CGTGGTTCAA 


ATTGA*TTGTC 


GATAATTTCT 


TGTTTCAGTG 


780 


GTGGGATCGA 


CTCCATAAGG 


GAACCCTAAC 


CGAAGCTATT 


GATATGAAAA 


CAATCCACAA 


840 


AC TC AC AT AA 


AAACAQTACA 


C.TAGTAGGAG 


AGAGGATTAT 


ATGAGATATT 


TGTTATATGT 


• 900 


GAAATTGAAA 


TTCAGATGAA 


TGCTTGTATT 


TATTTCTATT 


TGGACAAACT 


TCAACTTCTT 


960 


TTTGCTACCT 


TATCAGAAAA 


AAAAA 


• 






,985 


<210> 32 
<211> 988 

<212> DNA (| , 
<213> Lactuca sativ'a 








• 

• 


• 


<400> 32 
TATTCGCTTC 


AAAATCTCTT 


CCATTAACTG 


CTCAAATCTC 


CACCTTCGCC 


GGTCTTAATC 


60 


TCCGCCGGCG 


CACTTTCACC 


ACCATAACCG 


CCGCCATGdG 


TGACGATTCC 


GGCATGGACG 


120 


CTGTCCAGAG 


ACGTCTCATG 


TTTGATGATG 


AATGCATTTT 


GGTTGATGAA 


AATGACAATG 


180 


TTCTTGGGCA 


TGATACCAAA 


TACAATTGTC 


ACTTGATGGA 


GAAGATTGAG 


AAAGATAATT 


240 


TGCTTCATAG 


AGCATTCAGT 


GTATTTTTAT 


TCAATTCAAA 


ATACGAATTA 


CTCCTTCAGC 


300 


AAAGGTCAGA 


aaccAaggtg 


ACATTTCCTT 


TGGTATGGAC 


AAACACCTGT 


TGCAGCCATC 


360 


CACTATACAG 


AGAATCGGAG 


TTAATTCCCG 


AAAATGCCCT 


TGGGGTCAGA 


AATGCTGCAC 


420 


AGAGGAAGCT 


TCTAGATGAA 


CTCGGTATCC 


CTGCTGAAGA 


TGTTCCAGTT 


GATGAGTTCA 


480 


CAACTTTAGG 


TCGCATGTTG 


TACAAGGCTC 


CATCTGATGG 


AAAATGGGGT 


GAACATGAAG 


540 


TTGATTACCT 


ACTCTTCCTC 


GTGCGTGACG 


TTGCCGTGAA 


CCCAAACCCT 


GATGAGGTGG 


600 


CGGACATTAG 


ATACGTGAAC 


CAAGAAGAGT 


TAAAAGAGTT 


ACTAAGGAAG 


GCGGATGCGG 


660 


GTGAGGAGGG 


TTTGAAATTG 


TCCCCATGGT 


TTAGGCTAGT 


GGTGGACAAC 


TTCTTGTTCA 


720 


AATGGTGGGA 


TCATGTCCAA 


AAGGGGACAC 


TCAATGAAGC 


AATTGACATG 


AAAACCATTC 


780 


ATAAGTTGAT 


ATGAAAAATG 


GTTAATATTT 


ATGGTGGTGG 


TTTGGAGCTA 


ATAATTTGTG 


840 


TGTTCAAGTC 


TCGGTCCTTC 


TTTTTTTAAC 


GTTTTTTTTT 


TTTCTTTTAT 


TGGGAGTGTT 


900 


TATTGTGTAC 


TTGTAACGTA 


GGCCCTTTGG 


TTACGCTTTA 


AG AG T T T AA T 


AAAGAACCAC 


960 



33 
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CGTTAATTTA 


AAAAAAAAAA 


AAAAAAAA 






« 


988 


<210> 33 
<211> 1874 
<212> DNA 

<213> Chlamydomonas reinhardtii 










<400> 33 
GGCACGAGCT 


CGAGTTTGTT 


TTACCATGAC 


ATCGGGAATT 


TGGAAGCTTG 


AACTACCTCA 


60 


ATTACTCAAG 


TAACTCGCGG 


CAACACATTT 


CGCGCGCCAT 


CGCTGTTTTC 


TCTGCTCCAG 


120 


CTACCGAGCA 


GCATTGCTTT 


AGATCGCTTT 


GATGTCATAA 


ACTCCCACTT 


ATATGAGATC 


180 


CAGTTTCATC 


GAGCCCAAGC 


CCAGAGCGCA 


ACCTGTCTTA 


AGCCGCGGCA 


GGGCGTCCAT 


240 


GCGCCTCGCG 


CAAAGCCGTG 


CTCTCGTTGC 


GCGTGTCAGC 


TCCGCCCTGT 


GGCCGGGAGC 


300 


AGGACTTTCA 


CAGGCTCAAA 


GCGTTGCGGT 


GCGAATGGCG 


AGTTCGTCAA 


CCTGGGAAGG 


360 


CACGGGCCTG 


AGCCAGGATG 


ACTTCATGCA 


GCGGGACGAG 


TGCTTGGTGG 


TGGACGAGCA 


420 


GGACCGGCTG 


CTAGGCACCG 


CCAACAAG T A 


CGACTGCCAC 


CGCTTCGAGG 


CGGCCAAGGG 


480 


CCAGCCCTGC 


GGCCGCCTGC 


ACCGCGCCTT 


CTCCGTGTTC 


CTGTTCAGCC 


CCGACGGCCG 


540 


ACTGCTGCTG 


CAGCAGCGCG 


CAGCCAG CAA 


GGTGACGTTC 


CCGGGTGTGT 


GGACCAACAC 


600 


CTGCTGCTCG 


CACCCGCTGG 


CGGGCCAGGC 


GCCGGACGAG 


GTGGACCTGC 


CGGCGGCGGT 


660 


AGCCTCGGGC 


CAGGTGCCGG 


GCATCAAGGC 


GGCGGCGGTG 


CGCAAGCTGC 


AGCACGAGCT 


720 


GGGGATACCG 


CCGGAGCAGG 


TTCCCGCCTC 


CTCCTTCTCC 


TTCCTCACGC 


GTCTGCACTA 


780 


CTGCGCCGCC 


GACACCGCCA 


CGCACGGCCC 


GGCGGCGGAG 


TGGGGCGAGC 


ACGAGGTGGA 


840 


CTACGTGCTG 


TTCGTGCGGC 


CGCAGCAGCC 


CGTCAGCCTG 


CAGCCCAACC 


CAGACGAGGT 


900 


GGACGCCACG 


CGCTACGTGA 


CGCTGCCGGA 


GCTTCAGTCC 


ATGATGGCGG 


ACCCCGGCCT 


960 


CAGCTGGAGC 


CCCTGGTTCC 


GCATCCTGGC 


CACACAGCCC 


GCCTTCCTGC 


CCGCCTGGTG 


102O 


GGGCGACCTG 


AAGCGGCGCT 


GGCGCCCGGG 


CGGCAGCCGA 


CTGTCGGACT 


GGGGCACCAT 


1080 


CCACCGCGTC 


ATGTGAAGAA 


AAAGGGGAAG 


CAGGGGCGGG 


AGCGGGGGAT 


GAATGGGAAT 


1140 


GTGAATGCGA 


TTGTGATGCG 


GCGTGGGATG 


AGGTCTGAAG 


ACAGGGGGAA 


AATCGGGGGG 


1200 








AAAGCCGGGA 


GGCGGACCGC 


GCGATGGGTA 


1260 


CATGTGTGTG 


CGGAGGGTCG 


GTGGGTCGGT 


CGGTTGCGCG 


GCATAGCGTG 


TTGTGTGTGT 


1320 


GCGGCTGCAG 


GGGTATGTGG 


GCACCCGGGC 


ACGGAGGAGA 


AGGCACACGC 


AGGTGGCGCG 


1380 


GAGGTGTGTC 


AGGGGCCATG 


GGCGGGCCTC 


ACTCCTGGTC 


GTGCCCAGTG 


GTCTCGTGGG 


1440 


CAGAGTGGCA 


GGGGCTGCAC 


CCATATGAGC 


GGCGCACTGC 


CGCGCTGGGC 


TAAGTCCTTA 


1500 


TCACTTGGTG 


AGGTGGGGCG 


AGGTGGCTGT 


GGGCGGCGGG 


CGCAGTGGCA 


GAAGGACACG 


1560 


GTGTGTGAGC 


GGTGGAGCTC 


TGGCCGTGCC 


GGCCGTGAGG 


GGCGGATAGC 


GATATGACGT 


1620 



ss 
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TGTGCTTGGC 


CGCTGTAATG 


CGGGAGAATG 


TGCAGGCCGO 


GAGAAGCGGG 


CGGTGGCAGG 


1680 


AGGCCGCAGG 


CTGCAGCACC 


CGTTGGGGAG 


GTGCCACCTG 


CAGGCGCGGC 


GCCGGGCGGG 


1740 


CCTGAGTAAT 


GGGCGCCTGA 


GTAGTGGCGG 


CCACAGGAGG 


CGCAGGAGGC 


AGCAGCAGGA 


1800 


GGACGAGCTG 


GAGGGACCCG 


TTGGCAACCC 


AAGGTTGCGC 


GTGTAACATA 


GTGGCCATAC 


1860 


AAAAAAAAAA 


AAAA 










1874 


<210> 34 
<211> 954 
<212> DNA 

<213> Tagetes erecta 












<400> 34 


rTPAAATfTf 


CTCCGTCGCT 


CTTACTCCGC 


CATGGGTGAC 


GACTCCGGCA 


60 


TGPATf^PTHT 

1 \J\Jj\. X u\j 1 u 1 


TPAC^PGAPGT 


CTCATGTTTG 


acgatgaat.g 


.CATTTTGGTG 


GATGAGTGTG 


-. • 120 




GftGAPATGAT 


ACCAAATACA 


ATTGTCACTT 


GATGGAGAAG 


ATTGAAACAG 


180 


ftTAAAATPPT 


PPAPAGAGPA 


TTCAGCGTTT 
• 


TTCTATTCAA 


.TTCAAAATAC 


GAGTTACTTC 


240 




ptptppaapp 


AAGGTGACAT 


TTCCTTTAGT 


ATGGACCAAC 


ACCTGTTGCA 


300 




PT APAf; APAA 


rTCCGAGCTTG 


TTCCCGAAAA 


CGCCCTTGGA 


GTAAGAAATG 


360 


CTGCACAGAG 


GAAGCTGTTG 


GATGAACTCG 


GTATCCCTGC 


TGAAGATGTT 


CCCGTTGATC 


420 


AGTTTACTCC 


TTTAGGTCGC 


ATGCTCTACA 


AGGCTCCATC 


TGATGGAAAG 


TGGGGAGAAC 


480 


ATGAACTTGA 


CTACCTACTT 


TTCATAGTGA 


GAGACGTTGC 


TGTAAACCCG 


AACCCAGATG 


540 


AAGTGGCGGA 


TATCAAATAT 


GTGACCAGAA 


GAGTTAAAGG 


AGCTGCTAAG 


GAAAGCAGAT 


600 


GCGGGGGAGG 


AGGGTTTGAA 


GCTGTCTCCA 


TGGTTCAGGT 


TAGTGGTTGA 


TAACTTCTTG 


660 


TTCAAGTGGT 


GGGATCATGT 


GCAAAAGGGT 


ACACTCACTG 


AAG.CAATTGA 


TATGAAAACC 


720 


AT AC ACAAG C 


TGATATAGAA 


ACACACCCTC 


AACCGAAAAG 


TTCAAGCCTA 


ATAATTCGGG 


780 


TTGGGTCGGG 


TCTACCATCA 


ATTGTTTTTT 


TCTTTTAAGA 


AGTTTTAATC 


TCTATTTGAG 


840 


CATGTTGATT 


CTTGTCTTTT 


GTGTGTAAGA 


TTTTGGGTTT 


CGTTTCAGTT 


GTAATAATGA 


900 


ACCATTGATG 


GTTTGCAATT 


TCAAGTTCCT 


ATCGACATGT 


AG T GAT C T AA 


AAAA 


954 


<210> 35 

<211> .1.031. 

<212> DNA 

<213> Oryza sativa 












<400> 35 
CCTCCCTTTG 


CCTCGCGCAG 




PPPTTPTPPP 


pppp^app at 


cine c cine c en 


ou 


GCCGCCGCCG 


TGGAGGACGC 


CGGGATGGAC 


GAGGTCCAGA 


AGCGGCTCAT 


GTTCGACGAC 


120 


GAATGCATTT 


TGGTGGATGA 


ACAAGACAAT 


GTTGTTGGCC 


ATGAATCAAA 


ATATAACTGC 


180 


CATCTGATGG 


AAAAAATCGA 


ATCTGAAAAT 


CTACTTCATA 


GGGCTTTCAG 


TGTATTCCTG 


240 
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TTCAACTCAA AATATGAACT CCTACTCCAG CAACGATCTG CAACAAAGGT TACATTTCCT 300 

♦ 

CTAGTTTGGA CCAACACTTG CTGCAGCCAT CCTCTGTACC GTGAGTCTGA GCTTATACAG 36Q 

GAAAACTACC TTGGTGTTAG AAATGCTGCT CAGAGGAAGC TCTTGGATGA GCTGGGCATC 4 20 

CCAGCTGAAG ATGTGCCAGT TGACCAATTC ACCCCTCTTG GTCGGATGCT TTACAAGGCC 4 80 

CCATCTGATG GAAAATGGGG TGAACACGAG CTTGACTACC TGCTGTTCAT CGTCCGCGAC 54 0 

GTGAAGGTAG TCCCGAACCC GGACGAAGTG GCCGATGTGA AATACGTGAG CCGTGAGCAG 600 

CTGAAGGAGC TCATCCGCAA AGCGGACGCC GGAGAGGAAG GCCTGAAGCT GTCTCCCTGG 660 

TTCCGGCTkG TTGTTGACAA CTTCCTCATG GGCTGGTGGG ATCACGTCGA GAAAGGCACC 720 

CTCAACGAGG CCGTGGACAT GGAGACCATC CACAAGCTGA AGTAAGGACT GCGATGTTGT 780 

GGCTGGAAAG AATGATCCTG AAGACTCTGT TCTTGTGCTG CTGCATATTA CTCTTACCAG 84 0 

GGAAGTTGCA GAAGTCAGAA GAAGCTTTTG TATGTTTCTG GGTTTGGAGC TTGGAAGTGT 900 

TGGGCTCTGC TGACTGAGAG ATTCCCTTAT AGAGTGTCTA TGTTAATTTA GCAAACTTCT 960 

ATATTATACA TGATTAGTTA ATTGTTCGGT GTCTGAATAA AGAACAATAG CATGTTCCAT 1020 

GTTTATTTGC T 1031 

<210> 36 
<211> 232 
<212> PRT 

<213> Tagetes erecta 
<400> 36 

Met Gly Asp Asp Ser Gly Met Asp Ala Val Gin Arg Arg Leu Met Phe 
1 5 10 15 

Asp Asp Glu Cys lie Leu Val Asp Glu Cys Asp Asn Val Val Gly His 
20 25 30 

Asp Thr Lys Tyr Asn Cys His Leu Met Glu Lys lie Glu Thr Gly Lys 
35 4 0 4 5 

• * 

Met Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu 
50 55 60 

Leu Leu Leu Gin Gin Arg Ser Ala Thr Lys Val Thr Phe Pro Leu Val 
65 70 75 80 

Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser Glu Leu 
85 90 95 

Val Pro Glu Asn Ala Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu 
100 105 110 

Leu Asp Glu Leu Gly lie Pro Ala Glu Asp Val Pro Val Asp Gin Phe 
115 120 125 

Thr Pro Leu Gly Arg Met Leu Tyr Lys Ala Pro Ser Asp Gly Lys Trp 
130 135 140 
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Gly Glu His Gib Leu Asp Tyr Leu Leu 'Pfte lie Val Arg Asp Vai Ala 
145 150- • 155 160 

Val Asn Pro Asn Pro Asp Glu Vai Ala Asp lie Lys Tyr Vai Ser His 
165 170 . " 175 

Glu Glu Leu Lys Glu Leu Leu Arg Lys Ala Asp Ala Gly Glu Glu 'Gly 
180 . 185 190 

Leu Lys Leu Ser Pro Trp Phe Arg Leu Vai Val Asp Asn Phe Leu Phe 
195 200 205 

Lys Trp Trp Asp His Val Gin Lys Gly Thr Leu Thr Glu Ala lie Asp 
210 215 , 220 

Met Lys Thr lie His Lys Leu lie 
225 * 230 

<210> 37 . , . . ... 

<211> 280 
<212> PRT 

<213> Lactuca Sativa , , 

<400> 37 ' . 

Met Leu Lys Phe Pro Pro Phe Lys Thr lie Ala Thr Met lie Ser Ser 
1 5 10 15 ' 

Pro Tyr Ser Ser Phe Leu Leu Pro Arg Lys Ser Ser Phe Pro Pro Met 
20 25 - 30 

Pro Ser Leu Ala Ala Ala Ser Val Phe Leu His Pro Leu Ser Ser Ala 
35 40 ,45 

Ala Met Gly Asp Ser Ser Met Asp Ala Val Gin Arg Arg Leu Met Phe 
50 55 60 

Asp Asp Glu Cys lie Leu Val Asp Glu Asn Asp Lys Val Val Gly His 
65 70 75 80 

Asp Thr Lys Tyr &sn Cys His Leu Met Glu Lys lie Glu Lys Gly Asn 
• '85 90 " J 95 

Met Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu 
100 105 110 

Leu Leu Leu Gin Gin Arg Ser Ala Thr Lys Val Thr Phe Pro Leu Val 
115 120 ' 125 

Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser Glu Leu 

130 ........ 435 140 

lie Asp Glu Asn Ala Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu 
145 150 155 160 

Leu Asp Glu Leu Gly lie Pro Gly Ala Asp Val Pro Val Asp Glu Phe 
165 170 175 

Thr Pro Leu Gly Arg lie Leu Tyr Lys Ala Ala Ser Asp Gly Lys Trp 
180 185 190 

Gly Glu His Glu Leu Asp Tyr Leu Leu Phe Met Val Arg Asp Val Gly 
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195 200 205 

Leu Asp Pro Asn Pro Asp Glu Val Lys Asp Val Lys Tyr Val Asn Arq 
210 215 220 

Glu Glu Leu Lys Glu Leu Val Arg Lys Ala Asp Ala Gly Glu Glu Gly 
225 ' 230 " 235 240 

Val Lys Leu Ser Pro Trp Phe Lys Leu lie Val Asp Asn Phe Leu Phe 
245 250 . 255 

Gin Trp Trp Asp Arg Leu His Lvs Gly Thr Leu Thr Glu Ala lie Asp 
260 265 270 

Met Lys Thr lie His Lys Leu Thr 
275 280 

<210> 38 
<211> 229 
<212> PRT 

<213> Lactuca Sativa 
<400> 38 

Met Gly Asp Asp Ser Gly Met Asp Ala Val Gin Arg Arg Leu Met Phe 
15 10 15 

Asp Asp Glu Cys lie Leu Val Asp Glu Asn Asp Asn Val Leu Gly His 
20 25 . 30 

Asp Thr Lys Tyr Asn Cys His Leu Met Glu Lys lie Glu Lys Asp Asn 
35 40 " 45 

Leu Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu 
50 55 60 

Leu Leu Leu Gin Gin Arg Ser Glu Thr Lys Val Thr Phe Pro Leu Val 
65 70 " 75 80 

Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser Glu Leu 
85 90 95 

lie Pro Glu Asn Ala Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu 
♦100 105 110 

Leu Asp Glu Leu Gly lie Pro Ala Glu Asp Val Pro Val Asp Glu Phe 
115 120 125 

Thr Thr Leu Gly Arg Met Leu Tyr Lys Ala Pro Ser Asp Gly Lys Trp 
130 135 ~ 140 

Gly Glu His Glu Val Asp Tyr Leu Leu Phe Leu Val Arg Asp Val Ala 
145 150 155 160 

Val Asn Pro Asn Pro Asp Glu Val Ala Asp lie Arg Tyr Val Asn Gin 
165 170 175 

Glu Glu Leu Lys Glu Leu Leu Arg Lys Ala Asp Ala Gly Glu Glu Gly 
180 185 190 

Leu Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe Leu Phe 
195 200 205 



3r 
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Lys Trp Trp Asp His Val Gin Lys Gly -Thr Leu Asn Glu Ala lie Asp 
210 . 215 . 220 

Met Lys Thr 'lie His 
225 

<210> 39 
<211> 295 
<212> PRT 

<213> Adonis Palaestina 
<400> 39 

Met Ser Ser lie Arg lie Asn Pro Leu Tyr Ser lie Phe Ser Thr Thr 
1 5. 10 15 

Thr Lys Thr Leu Ser Ala Ser Cys Ser Ser Pro Ala Val His Leu Gin 
" 20 25 30 

Gin Arg Cys Arg Thr Leu Ser lie Ser Ser Ser lie Thr Asn Ser Pro . 
35 40 • ' 45 

Arg Arg Gly Leu Asn Arg Leu Phe Ala Ser Thr Ser Thr Met Gly Glu 

50 '55 60 - % 

Val Ala Asp Ala Gly Met Asp Ala Val Gin Lys Arg Leu Met Phe Asp 
65 70 75 .80 

• * 

Asp Glu Cys Ile^Leu* Val Asp Glu Asn Asp Lys Val Val Gly Tyr Asp 

85 90 95 

Ser Lys Tyr Asn Cys His Leu Met Glu Lys lie Glu Ala Glu Asn Leu 
100 * 105 110 

Leu His Arg Ala Phe Ser Val Phe Leu Phe. Asn Ser Lys Tyr Glu Leu 
115 120 125 

Leu Leu Gin Gin Arg Ser Ala- Thr Lys Val Thr Phe Pro Leu Val Trp 
130 " 135 140 

Thr Asn Thr Cys Cys Ser His Pro Leu Phe Arg Asp Ser Glu Leu lie 
145 , T" 150 155 160 

Glu Glu Asn Phe Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu Leu 
165 170 175 

Asp Glu Leu Gly lie Pro Ala Glu Asp Val Pro Val Asp Glu Phe Thr 
180 185 190 

Pro Leu Gly Arg lie Leu Tyr Lys Ala Pro Ser Asp Gly Lys Trp Gly 

135 ( _ 200 _ 205 

Glu His Glu Leu Asp Tyr Leu Leu Phe lie Val Arg Asp Val Lys Tyr 
210 215 220 

Asp Pro Asn Pro Asp Glu Val Ala Asp Ala Lys Tyr Val Asn Arg Glu 
225 230 235 240 

Glu Leu Lys Glu lie Leu Arg Lys Ala Asp Ala Gly Glu Glu Gly lie 
245 250 A 255 

Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe Leu Phe Lys 
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260 265 270 

Trp Trp Asp His Val Glu Glu Gly Lys lie Lvs Asp Val Ala Asp Met 
275 280 285 

Lys Thr lie His Lvs Leu Thr 
290 295 

<210> 40 
<211> 234 
<212> PRT 

<213> Adonis Palaestina 
<400> 40 

Met Gly Glu Val Thr Asp Ala Gly Met Asp Ala Val Gin Lys Arg Leu 
1 5 10 • 15 

Met Phe Asp Asp Glu Cys lie Leu Val Asp Glu Asn Asp Lys Val Val 
20 25 30 

Gly His Asp Ser Lys Tyr Asn Cys His Leu Met Glu Lys lie Glu Ala 
'35 40 45 

Glu Asn Leu Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys 
50 55 60 

Tyr Glu Leu Leu Leu Gin Gin Arg Ser Ala Thr Lys Val Thr Phe Pro 
65 70 75 , 80 

Leu Val Trp Thr Asn Thr Cys Cys Ser His Pro Leu Phe Arg Asp Ser 
85 90 95 

Glu Leu lie Glu Glu Asn Tyr Leu Gly Val Arg Asn Ala Ala Gin Arg 
100 105 110 

.Lys Leu Leu Asp Glu Leu Gly lie Pro Ala Glu Asp Val Pro Val Asp 
115 120 125 

Glu Phe Thr Pro Leu Gly Arg lie Leu Tyr Lys Ala Pro Ser Asp Gly 
130 „ 135 140 

Lys Trp Gly Glu His Glu Leu Asp Tyr Leu Leu Phe lie Val Arg Asp 
145 < 150 155 160 

Val Lys Tyr Asp Pro Asn Pro Asp Glu Val Ala Asp Ala Lys Tyr Val 
165 170 175 

Asn Arg Glu Glu Leu Arg Glu lie Leu Arg Lys Ala Asp Ala Gly Glu 
180 185 190 

Glu Gly Leu Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe 
195 200 205 

Leu Phe Lys Trp Trp Asp His Val Glu Gin Gly Thr lie Lys Glu Val 
210 215 220 

Ala Asp Met Lys Thr lie His Lys Leu Thr 
225 230 

<210> 41 
<211> 238 
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<212> PRT 

<213> Oryza Sativa 
<400> 41 

Met Ala Gly Ala Ala Ala Ala Val Glu Asp Ala Gly Met Asp Glu Val 
1 5 10 15 

Gin Lys Arg Leu Met Phe Asp Asp Glu Cys lie Leu Val Asp Glu Gin 
20 " 25 30 

Asp Asn Val Val Gly His Glu Ser Lys Tyr Asn Cys His Leu Met Glu 
35 40 45 

Lys lie Glu Ser Glu Asn Leu Leu His Arg Ala Phe Ser Val Phe Leu 
50 -55 '60 

Phe Asn Ser Lys Tyr Glu Leu Leu Leu Gin Gin Arg Ser Ala Thr Lys 
65 70 75 80 

Val Thr Phe Pro Leu Val 'Trp Thr Asn Thr cys Cys Ser His Pro Leu 

85 90 95 

Tyr Arg Glu Ser Glu Leu 'lie Gin Glu Asn Tyr Leu Gly Val Arg Asn 
100 , 105 . ' 110. 

Ala Ala Gin Arg Lys Leu Leu Asp Glu Leu Gly lie Pro Ala Glu Asp 

115 120 , 125 

• * 

> • • • 
Val Pro Val Asp Gin Phe Thr Pro Leu Gly Arg Met Leu Tyr Lys Ala 
130 135 140 

Pro Ser Asp Gly Lys Trp Gly Glu His Glu Leu Asp Tyr Leu Leu Phe 
145 150 155 . 160 

lie Val Arg Asp Val Lys Val Val Pro Asn Pro Asp Glu Val Ala Asp 
165 170 175 

Val Lys Tyr Val Ser Arg Glu Gin Leu Lys Glu Leu lie Arg Lys Ala 
180 185 190 

Asp Ala Gly Glu Glu Gly Leu Lys Leu Ser Pro Trp Phe Arg Leu Val 
195 ' t 200 205 

Val Asp Asn Phe Leu Met Gly Trp Trp Asp His Val Glu Lys Gly Thr 
210 215 220 

Leu Asn Glu Ala Val Asp Met Glu Thr lie His Lys Leu Lys 
225 230 235 

<210> 42 
<211> 233 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 42 

Met Thr Asp Ser Asn Asp Ala Gly Met Asp Ala Val Gin Arg Arg Leu 
1 5 .10 15 

Met Phe Glu Asp Glu Cys lie Leu Val Asp Glu Asn Asn Arg Val Val 
20 25 " 30 
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Gly His Asp Thr Lys Tyr Asn Cys His Leu Met Glu Lys He Glu Ala 
35 ( . 40 45 

Glu Asn Leu Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys 
50 55 60 

Tyr Glu Leu Leu Leu Gin Gin Arg Ser Lys Thr Lys Val Thr Phe Pre 
65 70 " 75 80 

Leu Val Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser 
85 90 95 

Glu Leu He Glu Glu Asn Val Leu Gly Val Arg Asn Ala Ala Gin Arg 
100 105 , HO 

Lys Leu Phe Asp Glu Leu Gly He Val Ala Glu Asp Val Pro Val Asp 
115 120 125 

Glu Phe Thr Pro Leu Gly Arg Met Leu Tyr Lys Ala Pro Ser Asp Gly 
130 135 140 

Lys Trp Gly Glu His Glu Val Asp Tyr' Leu Leu Phe He Val Arg Asp 
145 150 155 160 

Val Lys Leu Gin Pro Asn Pro Asp Glu Val Ala Glu He Lys Tyr Val 
165 170 175 

Ser Arg Glu Glu Leu Lys Glu Leu Val Lys Lys Ala ,Asp Ala Gly Asp 
180 185 " 190 

Glu Ala Val Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe 
195 200 205 

Leu Met Lys Trp Trp Asp His Val Glu Lys Gly Thr He Thr Glu Ala 
210 215 220 

Ala Asp Met Lys Thr He His Lys Leu 
225 230 

<210> 43 
<211> 293 " 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 43 

Met Leu Arg Ser Leu Leu Arg Gly Leu Thr His He Pro Arg Val Asn 
15 10 15 

Ser Ala Gin Gin Pro Ser Cys Ala His Ala Arg Leu -Gin Phe Lys Leu 
20 25 30 

Arg Ser Met Gin Leu Leu Ser Glu Asp Arg Thr Asp His Met Arg -Gly 
35 4 0 4 5 

Ala Ser Thr Trp Ala Gly Gly Gin Ser Gin Asp Glu Leu Met Leu Lys 
50 55 60 

Asp Glu Cys He Leu Val Asp Val Glu Asp Asn He Thr Gly His Ala 
65 70 75 80 

Ser Lys Leu Glu Cys His Lys Phe Leu Pro His Gin Pro Ala Gly Leu 

M2 
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8 5 ."9.'o' ♦ 95 

Leu His Arg Ala Phe Ser Val Phe Leu Phe Asp Asp Gin Gly Arg Leu 
100 105 110 

Leu Leu Gin Gin Arg Ala Arg Ser Lys lie Thr Phe Pro Ser Val Trp 
115 " 120 125 

Thr Asn Thr Cys Cys Ser His Pro Leu His Gly Gin Thr Pro Asp Glu 
130 135 140 

Val Asp Gin Leu Ser Gin Val Ala Asp Gly Thr Val Pro Gly Ala Lys 
145 150 155 160 

Ala Ala Ala lie Arg Lys Leu Glu His Glii Leu Gly lie Pro Ala His 

.165 170 , 175 

" * ■« 

Gin Leu Pro Ala. Ser. Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cys 

180 , . 185 ( t . 190 ... 

Ala Ala Asp Val Gin Pro Ala Ala Thr Gin Ser Ala Leu Trp Gly Glu 
195 ' t 200 ' 205 

His Glu Met Asp Tyr lie Leu Phe lie Arg Ala Asn Val Thr Leu Ala. 
210 215 220 

Pro Asn Pro Asp Glu ,Val Asp Glu Val Arg Tyr Val Thr Gin Glu Glu 
225 » "230 ' 235 . 240 

Leu Arg Gin Met Met Gin Pro Asp Asn Gly Leu Gin Trp Ser Pro Trp 
245 250 255 

Phe Arg lie lie Ala Ala Arg Phe Leu Glu Arg Trp Trp Ala Asp Leu 
260 265 270 

Asp Ala Ala Leu Asn Thr Asp Lys His Glu Asp Trp Gly Thr Val His 
275 280 285 

His lie Asn Glu Ala 
290 

<210> 44 
<211> 304 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 44 

Met Leu Arg Ser Leu Leu Arg Gly Leu Thr His lie Pro Arg Val Asn 
15 10 15 • 

Ser Ala Gin Gin Pro Ser Cys Ala His Ala Arg Leu Gin Phe Lys Leu 
20 25 30 

Arg Ser Met Gin Met Thr Leu Met Gin Pro Ser lie Ser Ala Asn Leu 
35 4 0 4 5 

Ser Arq Ala Glu Asp Arq Thr Asp His Met Arg Gly Ala Ser Thr Trp 
50 55 60 

Ala Gly Gly Gin Ser Gin Asp Glu Leu Met Leu Lys Asp Glu Cys lie 
65 70 75 80 



M3 
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Leu Val Asp Val Glu Asp Asn lie Thr Gly His Ala Ser Lys Leu Glu 
,'85 90 95 

Cys His Lys Phe Leu Pro His Pro Ala Gly Leu Leu His Arg Ala Phe 
100 105 110 

Ser Val Phe Leu Phe Asp Asp Gin Gly Arg Leu Leu Leu Gin Gin Arg 
115 120 125 

Ala Arg Ser Lys lie Thr Phe Pro Ser Val Trp Thr. Asn Thr Cys Cys 
130 135 140 

Ser His Pro Leu His Gly Gin Thr Pro Asp Glu Val Asp Gin Leu Ser 
145 ( 150 155 160 

Gin Val Ala Asp Gly Thr Val Pro Gly Ala Lys Ala Ala Ala lie Arg 
165 170 175 

Lys Leu Glu His Glu Leu Gly lie Pro Ala His Gin Leu Pro Ala Ser 
180 185 190 

Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cys Ala Ala Asp Val Gin 
195 200 205 

Pro Ala Ala Thr Gin Ser Ala Leu Trp Gly Glu His Glu Met Asp Tyr 
210 215 * " 220 

lie Leu Phe lie Arg Ala Asn Val Thr Leu Ala Pro Asn Pro Asp Glu 
225 230 235 240 

Val Asp Glu Val Arg Tyr Val Thr Gin Glu Glu Leu Arg Gin Met Met 
245 250 . " 255 

Gin Pro Asp Asn Gly Leu Gin Trp Ser Pro Trp Phe Arg He He Ala 
260 265 270 

Ala Arg Phe Leu Glu Arg Trp Trp Ala Asp Leu Asp Ala Ala Leu Asn 
275 280 * 285 

Thr Asp Lys His Glu Asp Trp Gly Thr Val His His He Asn Glu Ala 
290 295 300 

<210> 45 
<211> 307 
<212> PRT 

<213> Chlamydomonas reinhardtii 
<400> 45 

Met Arg Ser Ser Phe He Glu Pro Lys Pro Arg Ala Gin Pro Val Leu 
15 10 15 

Ser Arg Gly Arg Ala Ser Met Arg Leu Ala Gin Ser Arg Ala Leu Val 
20 25 30 

Ala Arg Val Ser Ser Ala Leu Trp Pro Gly Ala Gly Leu Ser Gin Ala 
35 40 45 

Gin Ser Val Ala Val Arg Met Ala Ser Ser Ser Thr Trp Glu Gly Thr 
50 55 60 

Gly Leu Ser Gin Asp Asp Phe Met Gin Arg Asp Glu Cys Leu Val Val 
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65 7 0 7.5 80 

Asp Glu Gin Asp Arg Leu Leu Gly Thr Ala Asn Lys Tyr Asp Cys His 
• 85 90 95 

Arg Phe Glu Ala Ala Lys Gly Gin Pro Cys Gly Arg Leu His Arg Ala 
100 105 110 

Phe Ser Val Phe Leu Phe Ser Pro Asp Gly Arg Leu Leu Leu Gin Gin 
115 120 125 

Arg Ala Ala Ser Lys Val Thr Phe Pro Gly Val Trp Thr Asn Thr Cys 
130 135 140 

Cys Ser His Pro Leu Ala Gly Gin Ala Prb Asp Glu Val Asp Leu Pro 
145 150 155 160 

Ala Ala Val Ala, Ser Gly Gin Val Pro Gly lie Lys Ala Ala Ala Val 

165 170 175 

* • • i*i , 

Arg Lys Leu Gin His Glu Leu Gly lie Pro Pro Glu Gin Val Pro Ala 
180 • 185 • 190 

Ser Ser Phe Ser Phe Leu Thr Arg Leu His Tyr Cys Ala Ala Asp Thr 
195 200 205 

Ala Thr His Gly Pro, Ala Ala Glu Trp Gly Glu His Glu Val Asp Tyr 
210 ■ " 215 220 

Val Leu Phe Val Arg Pro Gin Gin Pro Val Ser Leu Gin Pro Asn Pro 
225 230 235 ' 240 

Asp Glu Val Asp Ala Thr Arg Tyr Val Thr Le'u Pro .Glu Leu Gin Ser 
245 250 255 

Met Met Ala Asp Pro Gly Leu Ser Trp Ser Pro Trp Phe Arg lie Leu 
260 265 1 270 

Ala Thr Gin Pro Ala Phe Leu Pro Ala Trp Trp Gly Asp Leu Lys Arg 
275 280 285 

Arg Trp Arg Pro Gly Gly Ser Arg Leu Ser Asp Trp Gly Thr lie His 
290 295 300 

Arg Val Met 
305 

<210> 46 
<211> 1848 
<212> DNA 

<213> Adonis palaestina 
<400> 46 

GAGAGAAAAA GAGTGTTATA TTAATGTTAC TGTCGCATTC TTGCAACACA TAT TC AG ACT 60 
CCATTTTCTT GTTTTCTCTT CAAAACAACA AACTAATGTG ACGGAGTATC TAGCTATGGA 12 0 

ACTACTTGGT GTTCGCAACC TCATCTCTTC TTGCCCTGTC TGGACTTTTG GAACAAGAAA 180 - 

CCTTAGTAGT TCAAAACTAG CTTATAACAT ACATCGATAT GGTTCTTCTT G TAG AG TAG A 24 0 



MS 
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TTTTCAAGTG 


AGGGCTGATG 


GTGGAAGCGG 


GAGTAGAACT 


TCTGTTGCTT 


ATAAAGAGGG 


300 


« 

TTTTGTGGAC 


GAGGAGGATT 


TTATCAAAGC 


TGGTGGTTCT 


GAGCTTTTGT 


TTGTCCAAAT 


360 


GCAGCAAACA 


AAGTCTATGG 


AGAAACAGGC 


CAAGCTCGCC 


GATAAGTTGC 


CACCAATACC 


420 


TTTCGGAGAA 


TCTGTGATGG 


ACTTGGTTGT 


AATAGGTTGT 


GGACCTGCTG 


GTCTTTCACT 


480 


GGCTGCAGAA 


GCTGCTAAGC 


TAGGCTTGAA 


AGTTGGCCTT 


ATTGGTCCTG 


ATCTTCCTTT 


540 


TACAAATAAT 


TATGGTGTGT 


GGGAAGACGA 


GTTCAAAGAT 


CTTGGACTTG 


AACGTTGTAT 


600 


CGAGCATGCT 


TGGAAGGACA 


CCATCGTATA 


TCTTGACAAT 


GATGCTCCTG 


TCCTTATTGG 


660 


TCGTGCAT'AT 


GGACGAGTTA 


GCCGGCATTT 


GCTGCATGAA 


GAGTTGCTGA 


AAAGCTGTGT 


720 


CGAGTCAGGT 


GTATCATATC 


TGAATTCTAA 


AGTGGAAAGG 


ATCACTGAAG 


CTGGTGATGG 


780 


CCATAGTCTT 


GTAGTTTGTG 


AAAAC G AC A T 


CTTTATCCCT 


TGCAGGCTTG 


CTACTGTTGC 


840 


ATCTGGAGCA 


GCTTCAGGGA 


AACTTTTGGA 


GTATGAAGTA 


GGTGGCCCTC 


GTGTTTGTGT 


900 


CCAAACTGCT 


TATGGTGTGG 


AGGTTGAGGT 


GGAGAACAAT 


CCATACGATC 


CCAACTTAAT 


960 


GGTATTTATG 


GACTACAGAG 


ACTATATGCA 


ACAGAAATTA 


CAGTGCTCGG 


AAGAAGAATA 


1020 


TCCAACATTT 


CTCTATGTCA 


TGCCCATGTC 


GCCAACAAGA 


CTTTTTTTTG 


AGGAAACCTG 


1080 


TTTGGCCTCA 


AAAGATGCCA 


TGCCTTTCGA 


TCTACTGAAG 


AG AAAAC TAA 


TGTCACGATT 


1140 


GAAGACTCTG 


GGTATCCAAG 


T T AC AAAAAT 


TTATGAAGAG 


GAATGGTCTT 


ATATTCCTGT 


1200 


TGGGGGTTCT 


TTACCAAACA 


CAGAGCAAAA 


GAACCTAGCA 


TTTGGTGCTG 


CAGCAAGCAT 


1260 


GGTGCATCCA 


GCAACAGGCT 


ATTCGGTTGT 


ACGATCACTA 


TCAGAAGCTC 


CAAAATATGC 


1320 


TTCTGTAATT 


GCAAAGATTT 


TGAAGCAAGA 


TAACTCTGCA 


TATGTGGTTT 


CTGGACAAAG 


1380 


CAGTGCAGTA 


AACATTTCAA 


TGCAAGCATG 


GAGCAGTCTT 


TGGCCAAAGG 


AG CG AAAAC G 


1440 


TCAAAGAGCA 


TTCTTTCTTT 


TCGGGTTAGA 


GCTTATTGTG 


CAGCTAGATA 


TTGAAGCAAC 


1500 


CAGAACGTTC 


TXTAGAACCT 


TCTTCCGCTT 


GCCAACTTGG 


ATGTGGTGGG 


GTTTCCTTGG 


1560 


GTCTTCACTA 


TCATCTTTCG 


ATCTTGTATT 


GTTTTCCATG 


TACATGTTTG 


TTTTGGCCCC 


1620 


G AAC AG CAT G 


AGGATGTCAC 


TTGTGAGACA 


TTTGCTTTCA 


GATCCTTCTG 


GTGCAGTTAT 


1680 


GGTTAAAGCT 


TACCTCGAAA 


GGTAATCTGT 


TTTATGAAAC 


TATAGTGTCT 


CAT T AAAT AA 


1740 


AT GAG GAT CC 


TTCGTATATG 


TATATGATCA 


TCTCTATGTA 


TATCCTATAT 


TCTAATCTCA 


1800 


TAAAGTAATC 


GAAAATTCAT 


TGATAGAAAA 


AAAAAAAAAA 


AAAAAAAA 




1848 



<210> 47 
<211> 529 
<212> PRT 

<213> Adonis palaestina 
<400> 47 

Met Glu Leu Leu Gly Val Arg Asn Leu lie Ser Ser Cys Pro Val Trp 

mi 
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1 * 5 '10 • 15 

Thr Phe Gly Thr Arg Asn Leu Ser Ser Ser Lys Leu Ala Tyr Asn lie 
'20 25 30 

His Arg Tyr Gly Ser Ser Cvs Arg Val Asp Phe Gin Val Arg Ala Asp 
35 40 4 5 

Gly Gly Ser Gly Ser Arg Ser Ser Val Ala Tyr Lys Glu Gly Phe Val 
50 55 60 

Asp Glu Glu Asp Phe lie Lys Ala Gly Gly Ser Glu Leu Leu Phe Val 
65 70 75 80 

Gin Met Gin Gin Thr Lys Ser Met Glu Lys Gin Ala Lys Leu Ala Asp 

.85 90 95 

" ' •« 

Lys Leu Pro Pro. lie Pro Phe Gly Glu Ser Val Met Asp Leu Val Val 

100 . . ( 105 . , . 110 • 

lie Gly Cys Gly Pro Ala Gly Leu Ser Leu Ala Ala Glu Ala Ala Lys 
115 , 120 ' 125. 

Leu Gly Leu Lys Val Gly Leu lie Gly Pro Asp Leu Pro Phe Thr Asn. 
130 135 140 

Asn Tyr Gly Val Trp .Glu Asp Glu Phe Lys Asp Leu Gly Leu Glu Arg 
145 > '150 ' 155 . 160 

Cys lie Glu His Ala Trp Lys Asp Thr lie Val Tyr Leu Asp Asn Asp 
165 170 175 

Ala Pro Val Leu lie Gly Arg Ala Tyr Gly Arg Val Ser Arg His Leu 
180 185 190 

Leu His Glu Glu Leu Leu Lys Arg Cys Val Glu Ser Gly Val Ser Tyr 
195 200 205 

Leu Asp Ser Lys Val Glu Arg lie Thr Glu Ala Gly Asp Gly His Ser 
210 215 220 

Leu Val Val Cys Glu Asn Glu lie Phe lie Pro Cys Arg Leu Ala Thr 
225 230 235 " 240 

Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Glu Tyr Glu Val Gly 
245 250 255 

Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val 
260 265 270 

Glu Asn Asn Pro Tyr Asp Pro Asn Leu Met Val Phe Met Asp Tyr Arg 
275 280 285 

Asp Tyr Met Gin Gin Lys Leu Gin Cys Ser Glu Glu Glu Tyr Pro Thr 
290 295 300 

Phe Leu Tyr Val Met Pro Met Ser Pro Thr Arq Leu Phe Phe Glu Glu 
305 " 310 315 320 

Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Arg 
325 330 335 
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Lys Leu Met Ser Arg Leu Lys Thr Leu Gly lie Gin Val Thr Lys Val 
,340 345 350 

Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser Leu Pro Asn 
355 360 ~ 365 

Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His 
370 375 380 

Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys 
385 390 395 400 

Tyr Ala Ser Val lie Ala Lys lie Leu Lys Gin Asp Asn Ser Ala Tyr 
405 410 415 

Val Val Ser Gly Gin Ser Ser Ala Val Asn lie Ser Met Gin Ala Trp 
420 425 430 

Ser Ser Leu Trp Pro Lys Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu 
435 440 445 

Phe Gly' Leu Glu Leu lie Val Gin Leu' Asp lie Glu Ala Thr Arg Thr 
4 50 4 55 4 60 

Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp Trp Gly Phe 
465 470 475 480 

Leu Gly Ser 'Ser Leu Ser Ser Phe Asp Leu Val Leu , Phe Ser Met Tyr 
485 490 495 

Met Phe Val Leu Ala Pro Asn Ser Met Arg Met Ser Leu Val Arg His 
500 505 510 

Leu Leu Ser Asp Pro Ser Gly Ala Val Met Val Arg Ala Tyr Leu Glu 
515 520 525 

Arg 

<210> 48 
<211> 378 
<212> PRT 
<213> Potato 

<400> 48 

Asp Glu Phe Lys Asp Leu Gly Leu Gin Ala Cys lie Glu His Val Trp 
15 10 15 

Arg Asp Thr lie Val Tyr Leu Asp Asp Asp Asp Pro lie Leu lie Gly 
20 25 30 

Arg Ala Tyr Gly Arg Val Ser Arg His Leu Leu His Glu Glu Leu Leu 
35 40 45 

Lys Arg Cys Val Glu Ala Gly Vai Leu Tyr Leu Asn Ser Lys Val Asp 
50 55 60 

Arg lie Val Glu Ala Thr Asn Gly His Ser Leu Val Glu Cys Glu Gly 
65 70 75 " 80 

Asp Val Val lie Pro Cys Arg Phe Val Thr Val Ala Ser Gly Ala Ala 
85 90 95 



4V 
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Ser Gly Lys Phe Leu Gin Tyr Glu Leu Gly Gly Pro Arg Val Ser Val 
100 105 . 110 

Gin Thr Ala Tyr Gly Val Glu Val Glu Val Asp Asn Asn Pro Phe Asp 
115 120 . 125 

Pro Ser Leu Met Va! Phe Met Asp Tyr Arg Asp Tyr Val Arg His .Asp 
130 , 135 140 

Ala Gin Ser Leu Glu Ala Lys Tyr Pro Thr Phe Leu Tyr Ala Met Pro 
145 150 155 160 

Met Ser Pro Thr Arg Val Phe Phe Glu Glu Thr Cys Leu Ala Ser Lys 
165 . 170 175 

Asp Ala Met Pro Phe Asp Leu Leu Lys Lys Lys Leu Met Leu Arg Leu 
.180 " 185 190 

Asn Thr Leu Gly Val Arg. lie Lys Glu lie Tyr . Glu Glu Glu Trp Ser. • 
195 ' ' 200 ■ 205 

Tyr lie Pro Val Gly Gly Ser Leu Pro Asn Thr Glu* Gin Lys Thr Leu 

210 '215 220 ' ' - t 

Ala Phe Gly Ala Ala Ala Ser Met Val His Pro Ala Thr Gly Tyr Ser 
225 230 235 240 

• * 

Val Val Arg Ser 'Leu' Ser Glu Ala Pro Lys Cys Ala Phe Val Leu Ala 
*245 250 255 

Asn lie Leu Arg Gin Asn His Ser Lys Asn Met Leu Thr Ser Ser Ser 
260 265 270 

Thr Pro Ser lie Ser Thr Gin Ala Trp Asn Thr Leu Trp Pro Gin Glu 
275 280 285 

Arg Lys Arg Gin Arg Ser. Phe ' Phe Leu Phe Gly Leu Ala Leu lie Leu 
290 295 300 

Gin Leu Asp lie Glu Gly lie Arg Ser Phe Phe Arg Ala Phe Phe Arg 
305 , V" 310 315 320 

Val Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Ser Leu Ser Xaa 
325 330 335 

Ala Asp Leu Met Leu Phe Ala Phe Tyr Met Phe lie lie Ala Pro Asn 
340 345 350 

Asp Met Arg Arg Gly Leu lie Arg His Leu Leu Ser Asp Pro Thr Gly 

355 . / _ _ 360 _ 365 

Ala Thr Leu lie Arg Thr Tyr Leu Thr Phe 
370 375 

<210> 49 
<211> 524 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 49 
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Met Glu Cys Val Gly Ala Arg Asn Phe Ala Ala Met Ala Val Ser Thr 
1 • , • 5 10 15 

Phe Pro Ser Trp Ser Cys Arg Arg Lys Phe Pro Val Val Lys Arg Tyr 
20 25 30 

Ser 'Tyr Arg Asn He Arg Phe Gly Leu Cys Ser Val Arg Ala Ser Gly 
35 40 45 

Gly Gly Ser Ser Gly Ser Glu Ser Cys Val Ala Val Arg Glu Asp Phe 
50 55 60 

Ala Asp Glu Glu Asp Phe Val Lys Ala Gly Gly Ser Glu He Leu Phe 
65 70 .75 80 

Val Gin Met Gin Gin Asn Lys Asp Met Asp Glu Gin Ser Lys Leu Val 

85 90 95 

Asp Lys Leu Pro Pro He Ser He Gly Asp Gly Ala Leu Asp His Val 
100 105 " 110 

Val He' Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala 
115 120 125 

Lys Leu Gly Leu Lys Val Gly Leu He Gly Pro Asp Leu Pro Phe Thr 
130 135 140 

Asn Asn Tyr* Gly Val Trp Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin 
145 150 155 - 160 

Lys Cys He Glu His Val Trp Arg Glu Thr He Val Tyr Leu Asp Asp 
165 170 175 

Asp Lys Pro He Thr lie Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg 
180 185 " 190 

Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu Ser Gly Val Ser 
195 200 205 

Tyr Leu Ser Ser Lys Val Asp Ser He Thr Glu Ala Ser Asp Gly Leu 
210 215 220 

Arg Leu Val' Ala Cys Asp Asp Asn Asn Val He Pro Cys Arg Leu Ala 
225 230 235 240 

Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val 
245 250 255 

Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu 
260 265 270 

Val Glu Asn Ser Pro Tyr Asp Pro Asp Gin Met Val Phe Met Asp Tyr 
275 280 285 

Arg Asp Tyr Thr Asn Glu Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro 
290 295 300 

Thr Phe Leu Tyr Ala Met Pro Met Thr Lys Ser Arg Leu Phe Phe Glu 
305 310 315 ' 320 

Glu Thr Cys Leu Ala Ser Lys Asp Val Met Pro Phe Asp Leu Leu Lys 
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* 325 .33.0'' . • 335 

Thr Lys Leu Met Leu Arg Leu Asp Thr Leu Gly lie Arg lie Leu Lys 

34 0 34 5 350 ^ . 

Thr Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser Leu Pro 
355 360 365 

Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val 
370 375 380 

His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro 
385 390 395 400 

Lys Tyr Ala Ser Val He Ala Glu He Leu Arg Glu Glu Thr Thr Lys 
. 4 05 410 415 

Gin He Asn Ser, Asn He Ser Arg Gin Ala Trp Asp Thr Leu Trp Pro 
420 425 430 

Pro Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu Phe Gly Leu Ala Leu 
435 440 1 445 

He Val Gin Phe Asp Thr Glu Gly He Arg Ser' Phe Phe Arg Thr Phe 
450 455 460 

Phe Arg Leu Pro Lys ,Trp Met Trp Gin Gly Phe Leu Gly Ser Thr Leu 
465 * "470 '475 . 480 

Thr Ser Gly Asp Leu Val Leu Phe Ala Leu Tyr Met Phe Val lie Ser 
485 490 495 

Pro Asn Asn Leu Arg Lys Gly Leu He Asn His Leu He Ser Asp Pro 
500 505 510 

Thr Gly Ala Thr Met He Lys Thr Tyr Leu Lys Val 
515 '520 

<210> 50 

<211> 529 . V" 

<212> PRT 

<213> Adonis palaestina 
<400> 50 

Met Glu Leu Leu Gly Val Arg Asn Leu He Ser Ser Cys Pro Val Trp 
1 5 10 " 15 

Thr Phe Gly Thr Arg Asn Leu Ser Ser Ser Lys Leu Ala Tyr Asn He 
20 25 30 

His Arg Tyr Gly Ser Ser Cys Arg Val Asp Phe Gin Val Arg Ala Asp 
35 4 0 4 5 

Gly Gly Ser Gly Ser Arg Ser Ser Val Ala Tyr Lys Glu Gly Phe Val 
50 55 60 

Asp Glu Glu Asp Phe He Lys Ala Gly Gly Ser Glu Leu Leu Phe Val 
65 70 75 80 

Gin Met Gin Gin Thr Lys Ser Met Glu Lys Gin Ala Lys Leu Ala Asp 
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85 90 95 

* , 

Lys Leu Pro' Pro lie Pro Phe Gly Glu Ser Val Met Asp Leu Val Val 
100 105 110 

lie Gly Cys Gly Pro Ala Gly Leu Ser Leu Ala Ala Glu Ala Ala Lys 
115 120 125 

Leu Gly Leu Lys Val Gly Leu lie Gly Pro Asp Leu Pro Phe Thr Asn 
130 135 140 

Asn Tyr Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Glu Arg 
145 150 t 155 160 

Cys lie' Glu His Ala Trp Lys Asp Thr lie Val Tyr Leu Asp Asn Asp 
165 170 175 

Ala Pro Val Leu lie Gly Arg Ala Tyr Gly Arg Val Ser Arg His Leu 
180 185 190 

Leu His Glu Glu Leu Leu Lys Arg Cys Val Glu Ser Gly Val Ser Tyr 
' 195 200 ' 205 

Leu Asp Ser Lys Val Glu Arg lie Thr Glu Ala Gly Asp Gly His Ser 
210 215 220 

Leu Val Val Cys Glu Asn Glu lie Phe lie Pro Cys Arg Leu Ala Thr 
225 ' 230 235 " , 240 

Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Glu Tyr Glu Val Gly 
24 5 250 255 

Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val 
260 265 270 

Glu Asn Asn Pro Tyr Asp Pro Asn Leu Met Val Phe Met Asp Tyr Arg 
275 280 285 

Asp Tyr Met Gin Gin Lys Leu Gin Cys Ser Glu Glu Glu Tyr Pro Thr 
2 90 2 95 300 

Phe Leu. Tyr Val Met Pro Met Ser Pro Thr Arg Leu Phe Phe -Glu Glu 
305 , 310 315 320 

Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Arg 
325 330 335 

Lys Leu Met Ser Arg Leu Lys Thr Leu Gly lie Gin Val Thr Lys Val 
340 " 345 350 

Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser Leu Pro Asn 
355 360 365 

Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His 
370 375 380 

Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys 
385 390 395 400 

Tyr Ala Ser Val lie Ala Lys lie Leu Lys Gin Asp Asn Ser Ala Tyr 
405 410 415 
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Val Val Ser Gly Gin Ser Ser Ala Val .'As n Il»e Ser Met Gin Ala Trp 

420 • 425 . 430 

Ser Ser Leu 'Trp Pro Lys Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu 

435 440 . 445 

Phe Gly Leu Glu Leu lie Val Gin Leu Asp lie Glu Ala Thr Arg, Thr 

4 50 , 4 55 4 60 

Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp Trp Gly Phe 

465 470 475 480 

Leu Gly Ser Ser Leu Ser Ser Phe Asp Leu Val Leu Phe Ser Met Tyr 

485. 490 495 

Met Phe Val Leu. Ala Pro Asn Ser Met Arg Met Ser Leu. Val Arg His 

"500 " 505 510 

Leu Leu Ser Asp Pro Ser. Gly Ala Val Met Val Arg Ala Tyr Leu Glu • 

515 '520 1 525 

Arg ( ' 



<210> 51 

<211> 529 . , 

<212> PRT * 

<213> Adonis pa'laestina 

<400> 51 

Met Glu Leu Leu Gly Val Arg Asn Leu lie Ser Ser Cys Pro Val Trp 
1 5 ' 10 15 

Thr Phe Gly Thr Arg Asn Leu Ser Ser Ser Lys Leu Ala Tyr Asn lie 
20 25 30 

His Arg Tyr Gly Ser Ser Cys Arg Val Asp Phe Gin Val Arg Ala Asp 
35 40 45 

Gly Gly Ser Gly 'Ser Arg Thr Ser Val Ala Tyr Lys Glu Gly Phe Val 
50 ,55 60 

Asp Glu Glu Asp Phe lie Lys Ala Gly Gly Ser Glu Leu Leu Phe Val 
65 ~ 70 75 80 

Gin Met Gin Gin Thr Lys Ser Met Glu Lys Gin Ala Lys Leu Ala Asp 
85 .90 95 

Lys Leu Pro Pro lie Pro Phe Gly Glu Ser Val Met Asp Leu Val Val 
'100 105 110 



lie Gly Cys Gly Pro Ala Gly Leu Ser Leu Ala Ala Glu Ala Ala Lys 

115 120 125 

Leu Gly Leu Lys Val Gly Leu lie Gly Pro Asp Leu Pro Phe Thr Asn 
130 ' 135 140 

Asn Tyr Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Glu Arg 
145 150 155 160 
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Cys lie Glu His Ala Trp Lys Asp Thr lie Val Tyr Leu Asp Asn Asp 
165 170 175 

Ala Pro Val Leu He Gly Arg Ala Tyr Gly Arg Val .Ser Arg His Leu 
180 185 190 

Leu His Glu Glu Leu Leu Lys Arg Cys Val Glu Ser Gly Val Ser Tyr 
195 200 205 

Leu Asn Ser Lys Val Glu Arg He Thr Glu Ala Gly Asp Gly His Ser 
210 215 220' 

Leu Val Val Cys Glu Asn Asp He Phe He Pro Cys Arg Leu Ala Thr 
225 230 235 240 

Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Glu Tyr Glu Val Gly 
24 5 250 255 

Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val 
260 265 270 

Glu Asn'Asn Pro Tyr Asp Pro Asn Leu" Met Val Phe' Met Asp Tyr Arg 
275 280 285 

Asp Tyr Met Gin Gin Lys Leu Gin Cys Ser Glu Glu Glu Tyr Pro Thr 
290 295 300 

Phe Leu Tyr 'Val Met Pro Met Ser Pro Thr Arg Leu Phe Phe Glu Glu 
305 310 315 320 

Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Arg 
325 330 335 

Lys Leu Met Ser Arg Leu Lys Thr Leu Gly lie Gin Val Thr Lys lie 
340 345 350 

Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser Leu Pro Asn 
355 360 365 

Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His 
370 " 375 380 

Pro Ala Thr , Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys 
385 390 395 400 

Tyr Ala Ser Val He Ala Lys lie Leu Lys Gin Asp Asn Ser Ala Tyr 
405 410 415 

Val Val Ser Gly Gin Ser Ser Ala Val Asn lie Ser Met Gin Ala Trp 
420 425 430 

Ser Ser Leu Trp Pro Lys Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu 
435 440 445 

Phe Gly Leu Glu Leu lie Val Gin Leu Asp lie Glu Ala Thr Arg Thr 
450 455 460 

Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp Trp Gly Phe 
465 470 475 480 

Leu Gly Ser Ser Leu Ser Ser Phe Asp Leu Val Leu Phe Ser Met Tyr 
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4 85 .' V9& , 495 

Met Phe Val Leu Ala Pro Asn Ser Met Arg Met Ser Leu VaT Arg His 
• 500 505 510 

Leu Leu Ser Asp Pro Ser Gly Ala Val Met Val Lys Ala Tyr Leu Glu 
515 * 520 525 

Arg 



<210> 52 
<211> 533 
<212> PRT 
<213> Lettuce 

<400> 52 

Met Glu Cys Phe Gly Ala Arg Asn Met Thr Ala Thr Met Ala Val Phe. 
1 ' 5 ' ' * 10 * " . ' ' 15 

Thr Cys Pro Arg Phe Thr Asp Cys Asn lie Arg His Lys Phe Ser Leu 
20 ' 25 . ■ ■ 30 

Leu Lys Gin Arg Arg Phe Thr Asn Leu Ser Ala Ser Ser Ser Leu Arg 
35 , 40 45 

Gin lie Lys Cys Ser' Ala Lys Ser Asp Arg Cys Val Val Asp Lys Gin 
50-55 60 

Gly lie Ser Val Ala Asp Glu Glu Asp Tyr Val Lys Ala Gly Gly Ser 
65 70 75 80 

Glu Leu Phe Phe Val Gin Met Gin Arg Thr Lys Ser Met Glu Ser Gin 
85 90 95 

Ser Lys Leu Ser Glu Lys Leu Ala Gin lie Pro lie Gly Asn Cys lie 
100 105 110 

Leu Asp Leu Val Val lie Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala 
115 120 125 

Ala Glu Ser Ala Lys Leu Gly Leu Asn Val Gly Leu lie Gly Pro Asp 
130 135 14 0 

Leu Pro Phe Thr Asn Asn Tyr Gly Val Trp Gin Asp Glu Phe lie Gly 
145 150 155 160 

Leu Gly Leu Glu Gly Cys lie Glu His Ser Trp Lys Asp Thr Leu Val 
165 170 175 

Tyr Leu Asp Asp Ala Asp Pro lie Arg lie Gly Arg Ala Tyr Gly Arg 
180 185 190 

Val His Arg Asp Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu 
195 200 205 

Ser Gly Val Ser Tyr Leu Ser Ser Lys Val Glu Arg lie Thr Glu Ala 
210 215 220 

Pro Asn Gly Tyr Ser Leu lie Glu Cys Glu Gly Asn lie Thr lie Pro 
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225 



230 



235 



240 



Cys 



Arg Leu' Ala Thr Val Ala Ser Gly Ala 
245 250 



Ala 



Ser Gly 



Lys Phe Leu 
255 



Glu Tyr Glu Leu Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly 
260 265 270 

lie Glu Val Glu Val Glu Asn Asn Pro Tyr Asp Pro Asp Leu Met Val 
275 280 285 

Phe Met Asp Tyr Arg Asp Phe Ser Lys His Lys Pro Glu Ser Leu Glu 
290 295 300 

Ala Lys' Tyr Pro Thr Phe Leu Tyr Val Met Ala Met Ser Pro Thr Lys 
305 t t 310 315 320 

lie Phe Phe Glu Glu Thr Cys Leu Ala Ser Arg Glu Ala Met Pro Phe 
325 330 335 

Asn Leu Leu Lys Ser Lys Leu Met Ser Arg Leu Lys Ala Met Gly lie 
340 345 ^ 350 

Arg lie Thr Arg Thr Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly 
355 360 365 

Gly Ser Leu Pro Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala 
370 • 375 380. 

Ala Ser Met Val His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu 
385 390 395 400 

Ser Glu Ala Pro Asn Tyr Ala Ala Val lie Ala Lys lie Leu Arg Gin 
405 410 ** 415 

Asp Gin Ser Lys Glu Met lie Ser Leu Gly Lys Tyr Thr Asn lie Ser 
420 425 " 430 

Lys Gin Ala Trp Glu Thr Leu Trp Pro Leu Glu Arg Lys Arg Gin Arg 
435 440 445 

Ala Phe, Phe Leu Phe Gly Leu Ser His lie Val Leu Met Asp Leu Glu 
4 50 4 55 4 60 

Gly Thr Arg Thr Phe Phe Arg Thr Phe Phe Arg Leu Pro Lys Trp Met 
465 470 475 " 480 

Trp Trp Gly Phe Leu Gly Ser Ser Leu Ser Ser Thr Asp Leu lie lie 
485 490 495 

Phe Ala Leu Tyr Met Phe Val lie Ala Pro His Ser Leu Arg Met Glu 
500 505 510 

Leu Val Arg His Leu Leu Ser Asp Pro Thr Gly Ala Thr Met Val Lys 
515 520 ~ 525 

Ala Tyr Leu Thr lie 



530 



<210> 53 
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<211> 526 
<212> PRT 
<213> Tomato 

<400> 53 

Met Glu Cvs Val Gly Val Gin Asn Val Gly Ala Met Ala Val Leu Thr 
15 10 15 

Arg Pro Arg Leu 'Asn Ara Trp Ser Gly Gly Glu Leu Cys Gin Glu Lys 
20 25 30 

Ser lie Phe Leu Ala Tvr Glu Gin Tyr Glu Ser Lys Cys Asn Ser Ser 
35 " 40 45 

Ser Gly Ser Asp Ser Cys Val Val Asp Lys Glu Asp Phe Ala Asp Glu 
50 55 60 

Glu Asp Tyr lie, Lys Ala Gly Gly Ser Gin Leu Val Phe Val Gin Met 
65 70 75 80 

Gin Gin Lys Lys Asp Met Asp Gin Gin Ser Lys Leu Ser Asp Glu Leu 
85 90 95 

Arg Gin He Ser Ala Gly Gin Thr Val .Leu Asp Leu Val Val He Gly 
100 105 110 

Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala Lys Leu Gly 
115 • ' 120 125 

Leu Asn Val Gly Leu Val Gly Pro Asp Leu Pro Phe Thr Asn Asn Tyr 
130 135 140 

Gly Val Trp Glu Asp Glu Phe Lys Asp Leu GLy Leu .Gin Ala Cys He 
145 150 155 160 

Glu His Val Trp Arg Asp Thr He Val Tyr Leu Asp Asp Asp Glu Pro 
165 ' 170 175 

He Leu He Gly Arg Ala Tyr Gly Arg Val Ser Arg His Phe Leu His 
180 " 185 190 

•* • '* 

Glu Glu Leu Leu Lys Arg Cys Val Glu Ala Gly Val Leu Tyr Leu Asn 
195 ' 200 205 

Ser Lys Val Asp Arg He Val Glu Ala Thr Asn Gly Gin Ser Leu Val 
210 215 220 

Glu Cys Glu Gly Asp Val Val He Pro Cys Arg Phe Val Thr Val Ala 
225 230 235 240 

Ser Gly Ala Ala iSer Gly Lys Phe Leu Gin Tyr Glu Leu Gly Ser Pro 
245 " 250 255 

Arg Val Ser Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val Asp Asn 
260 265 270 

Asn Pro Phe Asp Pro Ser Leu Met Val Phe Met Asp Tyr Arg Asp Tyr 
275 ' 280 285 

Leu Arg His Asp Ala Gin Ser Leu Glu Ala Lys Tyr Pro Thr Phe Leu 
290 295 300 



SI 



9963055A1 I > 



WO 99/63055 



PCT/US99/12121 



Tyr Ala Met Pro Met Ser Pro Thr Arg Val Phe Phe Glu Glu Thr Cys 
305 • ( . 310 315 320 

Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Lys Lys Leu 
325 330 335 

Met Leu Arg Leu Asn Thr Leu Gly Val Arg He Lys Glu He Tyr Glu 
340 345 350 

Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser Leu Pro Asn Thr Glu 
355 360 365 

Gin Lys Thr Leu Ala Phe Gly Ala Ala Ala Ser Met Val His Pro Ala 
370 375 380 

Thr GXy T.yr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys Cys Ala 
385 390 395 400 

Ser Val Leu Ala Asn He Leu Arg Gin His Tyr Ser Lys Asn Met Leu 
405 410 " 415 

Thr Ser* Ser Ser He Pro Ser He Ser Thr Gin Ala Trp Asn Thr Leu 
420 425 430 

Trp Pro Gin Glu Arg Lys Arg Gin Arg Ser Phe Phe Leu Phe Gly Leu 
4 35 440 445 

Ala Leu He' Leu Gin Leu Asp He Glu Gly He Arg, Ser Phe Phe Arg 
4 50 4 55 4 60 

Ala Phe Phe Arg Val Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser 
465 470 475 480 

Ser Leu Ser Ser Ala Asp Leu Met Leu Phe Ala Phe Tyr Met Phe He 
485 490 495 

He Ala Pro Asn Asp Met Arg Lys Gly Leu He Arg His Leu Leu Ser 
500 505 510 

Asp Pro Thr Gly Ala Thr Leu He Arg Thr Tyr Leu Thr Phe 
515 520 " 525 

<210> 54 
<211> 516 
<212> PRT 

<213> Tagetes erecta 
<400> 54 

Met Ser Met Arg Ala Gly His Met Thr Ala Thr Met Ala Ala Phe Thr 
15 10 15 

Cys Pro Arg Phe Met Thr Ser He Arg Tyr Thr Lys Gin lie Lys Cys 
20 25 30 

Asn Ala Ala Lys Ser Gin Leu Val Val Lys Gin Glu lie Glu Glu Glu 
35 40 45 

Glu Asp Tyr Val Lys Ala Gly Gly Ser Glu Leu Leu Phe Val Gin Met 
50 55 60 
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Gin Gin Asn Lys Ser Met Aso Ala Gin .'SeV* Ser Lqu Ser Gin Lys Leu 
65 70 75 80 

Pro Arg Val ?ro lie Gly Gly Gly Gly Asp Ser Asn Cys lie Leu Asp 
85 90 95 

Leu Val Val lie Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala Gly Glu 
100 105 110 

Ser Ala Lys Leu Gly Leu Asn Val Ala Leu lie Gly Pro Asp Leu Pro 
115 120 125 

Phe Thr Asn Asn Tyr Gly Val Trp Glu Asp Glu Phe lie Gly Leu Gly 
130 . 135 140 

Leu Glu Gly Cys lie Glu His Val Trp Arg Asp Thr Val Val Tyr Leu 
145 " " " 150 155 ' 160 

Asp Asp Asn Asp Pro lie Leu lie Gly Arg Ala Tyr Gly Arg Val Ser 
165 170 ' ' . ' " 175 

Arg Asp Leu Leu His Glu Glu Leu Leu Thr Arg Cys Met Glu Ser Gly 
180 ' 185 . «190 

Val Ser Tyr Leu Ser Ser Lys Val Glu Arg lie Thr Glu Ala Pro Asn 
195 200 205 

Gly Leu Ser Leu Ile'»Glu Cys Glu Gly Asn lie Thr lie Pro Cys Arg 
210 215 220 

Leu Ala Thr Val Ala Ser Gly Ala Ala Ser Gly 'Lys Leu Leu Gin Tyr 
225 230 235 240 

Glu Leu Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly He Glu 
245 250 255 

Val Glu Val Glu Ser He Pro Tyr Asp Pro Ser Leu Met Val Phe Met 
260 265 270 

Asp Tyr Arg Asp Tyr Thr Lys His Lys Ser Gin Ser Leu Glu Ala Gin 
275 r" 280 285 

Tyr Pro Thr Phe Leu Tyr Val Met Pro Met Ser Pro Thr Lys Val Phe 
290 295 300 

Phe Glu Glu Thr Cys Leu Ala Ser Lys Glu Ala Met Pro Phe Glu Leu 
305 310 315 320 

Leu Lys Thr Lys Leu Met Ser Arg Leu Lys Thr Met Gly He Arg He 
325 330 335 

Thr Lys Thr Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser 
340 345 350 

Leu Pro Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser 
355 360 365 

Met Val His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu 
370 375 380 

Ala Pro Asn Tyr Ala Ala Val lie Ala Lys He Leu Gly Lys Gly Asn 
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385 390 395 400 

♦ , 

Ser Lys Glri Met Leu Asp His Gly Arg Tyr Thr Thr Asn lie Ser Lys 
405 410 415 

Gin Ala Trp Glu Thr Leu Trp Pro Leu Glu Arg Lys Arg Gin Arg Ala 
4 20 4 25 4 30 

Phe Phe Leu Phe Gly Leu Ala Leu lie Val Gin Met Asp lie Glu Gly 
435 440 445 

Thr Arg Thr Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp 
450 455 460 

Trp Gly* Phe Leu Gly Ser Ser Leu Ser Ser Thr Asp Leu lie lie Phe 
465 , , 470 475 480 

Ala Phe Tyr Met Phe lie He Ala Pro His Ser Leu Arg Met Gly Leu 
485 490 495 

Val Arg His Leu Leu Ser Asp Pro Thr Gly Gly Thr Met Leu Lys Ala 
500 505 510 

Tyr Leu Thr He 
515 

<210> 55 
<211> 501 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 55 

Met Asp Thr Leu Leu Lys Thr Pro Asn Lys Leu Asp Phe Phe lie Pro 
1 5 10 15 

Gin Phe His Gly Phe Glu Arg Leu Cys Ser Asn Asn Pro Tyr His Ser 
20 25 30 

Arg Val Arg Leu Gly Val Lys Lys Arg Ala He Lys lie Val Ser Ser 
'35 4 0 " 45 

Val Val Ser t Gly Ser Ala Ala Leu Leu Asp Leu Val Pro Glu Thr Lys 
50 55 60 

Lys Glu Asn Leu Asp Phe Glu Leu Pro Leu Tyr Asp Thr Ser Lys Ser 
65 70 75 * " 80 

Gin Val Val Asp Leu Ala He Val Gly Gly Gly Pro Ala Gly Leu Ala 
85 90 95 

Val Ala Gin Gin Val Ser Glu Ala Gly Leu Ser Val Cys Ser lie Asp 
100 105 HO 

Pro Ser Pro Lys Leu He Trp Pro Asn Asn Tyr Gly Val Trp Val Asp 
115 120 125 

Glu Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Thr Thr Trp Ser 
130 135 140 

Gly Ala Val Val Tyr Val Asp Glu Gly Val Lys Lys Asp Leu Ser Arg 

to 
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14 5 150 '': ' 156 • 160 

Pro Tyr Glv Arg Val Asn Arg Lys Gin Leu Lys Ser Lys Met Leu Gin 

165 170 175 . . 

Lys Cys lie Thr Asn Gly Val Lvs Phe His Gin Ser Lys Val Thr Asn 
180 185 . J 190 

Val Val His Glu Glu Ala Asn Ser Thr Val Val Cys Ser Asp Gly Val 
195 200 205 

Lys lie Gin Ala Ser Val Val Leu Asp Ala Thr Gly Phe Ser Arg Cys 
210 215 220 

Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin Val Ala Tyr 
225 „ . . h 230 235 . 240 

Gly lie Val Ala. Glu Val Asp Gly His Pro Phe Asp Val Asp Lys Met 
245 . . ( 250 ( . , . * 255 

Val Phe Met Asp Trp Arg Asp Lys His Leu Asp Ser Tyr Pro Glu Leu 
260 , ' 265 ' 270 

Lys Glu Arg Asn Ser Lys lie Pro Thr Phe Leu Tyr Ala Met Pro Phe 
275 280 285 

Ser Ser Asn Arg He. Phe Leu Glu Glu Thr Ser Leu Val Ala Arg Pro 
290 / " 295 .300 

Gly Leu Arg Met Glu Asp He Gin Glu Arg Met Ala Ala Arg Leu Lys 
305 310 315 ' 320 

His Leu Gly He Asn Val Lys Arg He Glu Glu Asp *Glu Arg Cys Val 
325 330 335 

He Pro Met Gly Gly Pro Leu Pro Val Leu Pro Gin Arg Val Val Gly 
340 345 350 

He Gly Gly Thr Ala Gly Met Val His Pro Ser Thr Gly Tyr Met Val 

355 360 365 

>* • •* 

Ala Arg Thr Leu Ala Ala Ala Pro He Val Ala Asn Ala He Val Arg 
370 375 380 

Tyr Leu Gly Ser Pro Ser Ser Asn Ser Leu Arg Gly Asp Gin Leu Ser 
385 390 395 400 

Ala Glu Val Trp Arg Asp Leu Trp Pro He Glu Arg Arg Arg Gin Arg 
405. 410 415 

Glu Phe Phe Cys Phe Gly Met Asp lie Leu Leu Lys Leu Asp Leu Asp 
420 425 " 430 

Ala Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Gin Pro His Tyr 
435 440 445 

Trp His Gly Phe Leu Ser Ser Arg Leu Phe Leu Pro Glu Leu Leu Val 
450 " 455 460 

Phe Gly Leu Ser Leu Phe Ser His Ala Ser Asn Thr Ser Arg Leu Glu 
465 "* 470 475 480 
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He Met Thr Lys Gly Thr Val Pro Leu Ala Lys Met He Asn Asn Leu 
485 490 495 

Val Gin Asp Arg Asp 
500 

<210> 56 
<211> 502 
<212> PRT 

<213> Adonis palaestina 
<400> 56 

Met Asp Thr Leu Leu Arg Thr His Asn Lys Leu Glu Leu Leu Pro Thr 
15 10 15 

Leu His Gly Phe Ala Glu Lys Gin His Leu Val Ser Thr Ser Lys Leu 
20 25 30 

Gin Asn Gin Val Phe Arg He Ala Ser Arg Asn He His Pro Cys Arg 
35 4 0 4 5 

Asn Gly Thr Val Lys Ala Arg Gly Ser Ala Leu Leu Glu Leu Val Pro 
50 55 60 

Glu Thr Lys Lys Glu Asn Leu Glu Phe Asp Leu Pro Ala Tyr Asp Pro 
65 70 75 80 

Ser Arg Gly He Val Val Asp Leu Ala Val Val Gly Gly Gly Pro Ala 

8 5 90 95 

Gly Leu Ala He Ala Gin Gin Val Ser Glu Ala Gly Leu Leu Val Cys 
100 105 110 

Ser He Asp Pro Ser Pro Lys Leu He Trp Pro Asn Asn Tyr Gly Val 
115 120 125 

Trp Val Asp Glu Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Thr 
130 135 140 

Thr Trp Ser Gly Ala Val Val Tyr Thr Asp Asp Asn Ser Lys Lys Tyr 
145 150 155 160 

Leu Asp Arg Pro Tyr Gly Arg Val Asn Arg Lys Gin Leu Lys Ser Lys 
165 170 ' 175 

Met Leu Gin Lys Cys Val Thr Asn Gly Val Lys Phe His Gin Ala Lys 
180 185 190 

Val lie Lys Val He His Glu Glu Ser Lys Ser Leu Leu He Cys Asn 
195 200 205 

Asp Gly lie Thr He Asn Ala Thr Val Val Leu Asp Ala Thr Gly Phe 
210 215 220 

Ser Arg Cys Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin 
225 230 235 240 

Val Ala Tyr Gly He Met Ala Glu Val Glu Glu His Pro Phe Asp Leu 
245 250 255 
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Asp Lys Met Leu Phe Met Asp Trp Arg 'Asp SeT His Leu Asn Glu Lys 
260 265 • 270 

Leu Glu Leu Lys Asp Lys Asn Arg Lys lie Pro Thr Phe Leu Tyr Ala 
275 280 285 

Met Pro Phe Ser Ser Thr Lvs lie Phe Leu Glu Glu Thr Ser Leu. Val 
290 . 295 300 

Ala Arg Pro Gly Leu Arg Phe Glu Asp lie Gin Glu Arg Met Val Ala 
305 " 310 315 320 

Arg Leu Lys His Leu Gly lie Lys Val Lys Ser lie Glu Glu Asp Glu 
325- 330 335 

Arg Cys Val ..lie- Pro Met Gly Gly Pro Leu Pro Val Leu. Pro Gin Arg 
34 0 " 34 5 350 

Val Val Gly lie Gly Gly. Thr Ala Gly Met V,al His, Pro Ser Thr Gly • 
355 360 ' 365 

Tyr Met Val Ala Arg Thr, Leu Ala Ala Ala Pro Val Val Ala Lys Ser 
370 375 . 380 

lie Val Gin Tyr Leu Gly Ser Asp Arg Ser Leu Ser Gly Asn Glu Leu 
385 390 395 4-00 

Ser Ala Glu Val' Trp'*Lys Asp Leu Trp Pro lie . Glu Arg Arg Arg Gin 
405 410 415 

Arg Glu Phe Phe Cys Phe Gly Met Asp lie Leu Leu Lys Leu Asp Leu 
420 425 430 

Gin Gly Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro His 
4 35 4 40 4 45 

Tyr Trp His Gly Phe Leu Ser Ser Arg Leu Phe Leu Pro Glu Leu Leu 
450 ^ 455 460 

Phe Phe Gly Leu Ser Leu Phe Ser His Ala Ser Asn Ala Ser Arg lie 
465 * - 470 475 480 

Glu lie Met Ala Lys Gly Thr Val Pro Leu Val Asn Met Met Asn Asn 
485 490 495 

Leu lie Gin Asp Thr Asp 
500 

<210> 57 
<211> 498 
<212> PRT 
<213> Pepper 

<400> 57 

Met Asp Thr Leu Leu Arg Thr Pro Asn Asn Leu Glu Phe Leu His Gly 
1 5 10 15 

Phe Gly Val Lys Val Ser Ala Phe Ser Ser Val Lys Ser Gin Lys Phe 
20 25 30 



<o1 
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Gly Ala Lys Lys Phe Cys Glu Gly Leu Gly Ser Arg Ser Val Cys Val 
35,. 4 0 4 5 

Lys Ala Ser Ser Ser Ala Leu Leu Glu Leu Val Pro Glu Thr Lys Lys 
50 55 60 

Glu Asn Leu Asp Phe Glu Leu Pro Met Tyr Asp Pro Ser Lys Gly Val 
65 70 75 80 

Val Val Asp Leu Ala Val Val Gly Gly Gly Pro Ala Gly Leu Ala Val 
85 90 ~ 95 

Ala Gin Gin Val Ser Glu Ala Gly Leu Ser Val Cys Ser lie Asp Pro 
100 105 , 110 

Asn Pro Lys Leu lie Trp Pro Asn Asn Tyr Gly Val Trp Val Asp Glu 
115 120 " 125 

Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Ala Thr Trp Ser Gly 
130 135 140 

Ala Ala' Val Tyr lie Asp Asp Lys Thr Thr Lys Asp Leu Asn Arg Pro 
145 150 155 160 

Tyr Gly Arg Val Asn Arg Lys Gin Leu Lys Ser Lys Met Met Gin Lys 
165 170 175 

Cys lie Leu' Asn Gly Val Lys Phe His Gin Ala Lys, Val lie Lys Val 
180 185 190 

lie His Glu Glu Ser Lys Ser Met Leu lie Cys Asn Asp Gly lie Thr 
195 200 205 

lie Gin Ala Thr Val Val Leu Asp Ala Thr Gly Phe Ser Arg Ser Leu 
210 215 220 

Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin Val Ala Tyr Gly 
225 230 235 240 

lie Leu Ala Glu Val Glu Glu His Pro Phe Asp Val Asn Lys Met Val 
245 250 255 

Phe Met Asp* Trp Arg Asp Ser His Leu Lys Asn Asn Val Glu Leu Lys 
260 265 270 

Glu Arg Asn Ser Arg lie Pro Thr Phe Leu Tyr Ala Met Pro Phe Ser 
275 280 285 

Ser Asn Arg lie Phe Leu Glu Glu Thr Ser Leu Val Ala Arg Pro Gly 
2 90 2 95 300 

Leu Gly Met Asp Asp lie Gin Glu Arg Met Val Ala Arg Leu Ser His 
305 310 315 320 

Leu Gly lie Lys Val Lys Ser lie Glu Glu Asp Glu His Cys Val lie 
325 330 335 

Pro Met Gly Gly Pro Leu Pro Val Leu Pro Gin Arg Val Val Gly lie 
340 345 ^ 350 

Gly Gly Thr Ala Gly Met Val His Pro Ser Thr Gly Tyr Met Val Ala 
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355 ' 360 '* ; ' * 365 

Arg Thr Leu Ala Ala Ala Pro Val Val Ala Asn Ala He He Gin Tyr 
370 375 380 

Leu Ser Ser Glu Arg Ser His Ser Gly Asp Glu Leu Ser Ala Ala Val 
385 390 395 . 400 

Trp Lys Asp Leu Trp Pro He Glu Arg Arg Arg Gin Arg Glu Phe Phe 
405 410 415 

Cys Phe Gly Met Asp He Leu Leu Lys Leu Asp Leu Pro Ala Thr Arg 
420 425 430 

Arg Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro Arg Tyr Trp His Gly 
4 35,. . ■., 4 40 . 4 4 5- 

Phe Leu Ser Ser* Arg Leu Phe Leu Pro Glu Leu He Val Phe Gly Leu 

450 . 455 , . 460, • • 

Ser Leu Phe Ser His Ala Ser Asn Thr Ser Arg Leu Glu He Met Thr 
465 470, " 475 ' 480 

Lys Gly Thr Leu Pro Leu Val His Met lie Asn Asn Leu Leu Gin Asp 
485 490 495 

Lys Glu ,\ ' * 



<210> 58 

<211> 500 

<212> PRT 

<213> Tomato 

<400> 58 

Met Asp Thr Leu Leu Lys Thr Pro Asn Asn Leu Glu Phe Leu Asn Pro 
1 5 10 15 

His His Gly Phe A,la Val Lys Ala Ser Thr Phe Arg Ser Glu Lys His 
20 25 30 

His Asn Phe Gly Ser Arg Lys Phe Cys Glu Thr Leu Gly Arg Ser Val 
35 4 0 4 5 

Cys Val Lys Gly Ser Ser Ser Ala Leu Leu Glu Leu Val Pro Glu Thr 
50 55 60 

Lys Lys Glu Asn Leu Asp Phe Glu Leu Pro Met Tyr Asp Pro Ser Lys 
65 -. 70 75 80 

Gly Val Val Val Asp Leu Ala Val Val Gly Gly Gly Pro Ala Gly Leu 
85 90 95 

Ala Val Ala Gin Gin Val Ser Glu Ala Gly Leu Ser Val Cys Ser He 
100 105 110 

Asp Pro Asn Pro Lys Leu He Trp Pro Asn Asn Tyr Gly Val Trp Val 
115 120 125 

Asp Glu Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Ala Thr Trp 
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130 

Ser Gly Ala' Ala Val 
145 



Arg Pro Tyr Gly Arg 
165 

Gin Lys Cys lie Met 
180 

Lys Val He His Glu 
195 

He Thr* lie Gin Ala 
210 , 

Ser Leu Val Gin Tyr 
225 

Tyr Gly He Leu Ala 
245 

Met Val Phe Met Asp 
260 

Leu Lys Glu Arg Asn 
275 ' 



Phe Ser Ser Asn Arg 
290 



Pro Gly Leu Arg He 
305 

. Asn His Leu Gly He 
325 

Leu lie Pro Met Gly 
340 

Gly He Gly Gly Thr 

355 ■» 

Val Ala Arg Thr Leu 
370 



Gin Tyr Leu Gly Ser 
385 

Ala Val Trp Lys Asp 
405 

Phe Phe Cys Phe Gly 
420 

Thr Arg Arg Phe Phe 
435 

His Gly Phe Leu Ser 
450 



135 



Tyr He Asp Asp Asn Thr 
150 155 

Val Asn Arg Lys Gin Leu 
170 

Asn Gly Val Lys Phe His 
185 

Glu Ser Lys Ser Met Leu 
200 

Thr Val Val Leu Asp Ala 
215 



Asp Lys Pro Tyr Asn Pro 

230 235 

Glu Val Glu Glu His Pro 
' 250 



Trp Arg Asp Ser His Leu 
265 



Ser Arg He Pro Thr Phe 
280 



He Phe Leu Glu Glu Thr 
295 



Asp Asp lie Gin Glu Arg 

310 315 

Lys Val Lys Ser He Glu 
330 

Gly Pro Leu Pro Val Leu 
345 



Ala Gly Met Val His Pro 
360 



Ala Ala Ala Pro Val Val 
375 



Glu Arg Ser His Ser Gly 
390 " 395 



Leu Trp Pro lie Glu Arg 
410 



Met Asp lie Leu Leu Lys 
425 



Asp Ala Phe Phe Asp Leu 
44 0 



Ser Arg Leu Phe Leu Pro 
455 ■ 



140 



Ala Lys Asp Leu His 
160 

Lys Ser Lys Met Met 
175 

Gin Ala Lys Val lie 
190 

lie Cys Asn Asp Gly 
205 



Thr Gly Phe Ser Arg 
220 



Gly Tyr Gin Val Ala 
240 



Phe Asp Val Asn Lys 
255 



Lys Asn Asn Thr Asp 
270 



Leu Tyr Ala Met Pro 
• 285 



Ser Leu Val Ala Arg 
300 



Met Val Ala Arg Leu 
320 

Glu Asp Glu His Cys 
335 



Pro Gin Arg Val Val 
350 



Ser Thr Gly Tyr Met 
365 



Ala Asn Ala lie lie 
380 



Asn Glu Leu Ser Thr 
400 



Arg Arg Gin Arg Glu 
415 



Leu Asp Leu Pro Ala 
430 



Glu Pro Arg Tyr Trp 
4 4 5 



Glu Leu He Val Phe 
4 60 
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Gly Leu Ser Leu Phe Ser His Ala Ser 'A'Sn Th*r Ser Arg Phe Glu lie 
465 470 ■ 475 480 

Met Thr Lys *Gly Thr Val Pro Leu Val Asn Met lie Asn Asn Leu Leu 
4 85 4 90 4 95 

Gin Asp Lys Glu 
500 , 

<210> 59 
<211> 500 
<212> PRT 
<213> Tobacco 

<400> 59 • , • 

Met Asp Thr Leu Leu Lys Thr Pro Asn Lys Leu Glu Phe Leu His Pro 
1 .5 10 15 

Val His Gly Phe Ser Val Lys Ala Ser Ser Phe Asn Ser Val Lys Pro 
20 25 30 

His Lys Phe Gly Ser Arg Lys He Cys Glu Asn Trp Gly Lys Gly Val 
35 40 45 

Cys Val Lys Ala Lys Ser Ser Ala Leu Leu Glu Leu Val Pro Glu Thr 
50 ■ . * 55 «60 

Lys Lys Glu Asn Leu Asp Phe Glu Leu Pro Met Tyr Asp Pro Ser Lys 
65 70 75 , 80 

Gly Leu Val Val Asp Leu Ala Val Val Gly Gly Gly Pro Ala Gly Leu 
85 90 95 

Ala Val Ala Gin Gin Val Ser Glu Ala Gly Leu Ser Val Val Ser He 
100 1Q5 110 

Asp Pro Ser Pro Lys Leu He Trp Pro Asn Asn Tyr Gly Val Trp Val 
115 120 125 

Asp Glu Phe Glu'-Ala Met Asp Leu Leu Asp Cys Leu Asp Ala Thr Trp 
130 . 135 140 

Ser Gly Thr Val Val Tyr He Asp Asp Asn Thr Thr Lys Asp Leu Asp 
145 ~ 150 155 160 

Arg Pro Tyr Gly Arg Val Asn Arg Lys Gin Leu Lys Ser Lys Met Met 
165 170 175 

Gin Lys Cys He Leu Asn Gly Val Lys Phe His His Ala Lys Val He 
180 185 190 

Lys Val lie His Glu Glu Ala Lys Ser Met Leu He Cys Asn Asp Gly 
195 200 205 

Val Thr lie Gin Ala Thr Val Val Leu Asp Ala Thr Gly Phe Ser Arg 
210 215 220 

Cys Leu Val Gin Tyr Asp Lys Pro Tyr Lys Pro Gly Tyr Gin Val Ala 
225 230 235 240 
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Tyr Gly He Leu Ala Glu Val Glu Glu His Pro Phe Asp Thr Ser Lys 
245 250 255 

Met Val Leu Met Asp Trp Arg Asp Ser His Leu Gly Asn Asn Met Glu 
260 265 270 

Leu Lys Glu Arg Asn Arg Lys Val Pro Thr Phe Leu Tyr Ala Met Pro 
275 280 285 

Phe Ser Ser Asn Lys He Phe Leu Glu Glu Thr Ser. Leu Val Ala Arg 
290 295 300 

# 

Pro Gly Leu Arg Met Asp Asp He Gin Glu Arg Met Val Ala Arg Leu 
305 310 , 315 320 

Asn Hi,s Leu Gly He Lys Val Lys Ser lie Glu Glu Asp Glu His Cys 
325 330 335 

Val He Pro Met Gly Gly Ser Leu Pro Val He Pro Gin Arg Val Val 
340 345 350 

Gly Thr 'Gly Gly Thr Ala Gly Leu Val ' His Pro Ser Thr Gly Tyr Met 
355 360 365 

Val Ala Arg Thr Leu Ala Ala Ala Pro Val Val Ala Asn Ala lie He 
370 375 380 

His Tyr Leu Gly Ser Glu Lys Asp Leu Leu Gly Asn Glu Leu Ser Ala 
385 390 395 400 

Ala Val Trp Lys Asp Leu Trp Pro He Glu Arg Arg Arg Gin Arg Glu 
405 410 415 

Phe Phe Cys Phe Gly Met Asp He Leu Leu Lys Leu Asp Leu Pro Ala 
420 425 430 

Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro Arg Tyr Trp 
435 440 445 

His Gly Phe Leu Ser Ser Arg Leu Tyr Leu Pro Glu Leu He Phe Phe 
450 455 460 

Gly Leu Ser Leu Phe Ser Arg Ala Ser Asn Thr Ser Arg He Glu He 
465 470 475 480 

Met Thr Lys Gly Thr Leu Pro Leu Val Asn Met He Asn Asn Leu Leu 
485 490 495 

Gin Asp Thr Glu 
500 



<210> 60 
<211> 511 
<212> PRT 

<213> Tagetes erecta 
<400> 60 

Met Asp Thr Phe Leu Arg Thr Tyr Asn Ser Phe Glu Phe Val His Pro 
15 10 15 
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Ser Asn Lys Phe Ala Gly Asn Leu Asn Asn Leu Asn Gin Leu Asn Gin 
20 25 30 

Ser Lys Ser Gin Phe Gin Asp Phe Arg Phe Gly Pro Lys Lys Ser Gin 
35 40 . -45 

Phe Lys Leu Gly Gin Lys Tyr Cys Val Lys Ala Ser Ser Ser Ala . Leu 
50 .55 60 

Leu Glu Leu Val Pro Glu lie Lys Lys Glu Asn Leu Asp Phe Asp Leu 
65 70 75 80 

Pro Met Tyr Asp Pro Ser Arg Asn Val Val Val Asp Leu Val Val Val 
85 90 95 

Gly Gly Gly Pro . Ser Gly Leu Ala Val Ala Gin Gin Val Ser Glu Ala 
100 " ^ 105 110 

Gly Leu Thr Val Cys Ser. lie Asp Pro Ser Pro. Lys Leu lie Trp Pro ■ 
115 120 ' 125 

Asn Asn Tyr Gly Val Trp Val Asp Glu Phe Glu Ala' Met Asp Leu Leu 
130 " ' 135 . 140 

• • 

Asp Cys Leu Asp Thr Thr Trp Ser Ser Ala Val Val Tyr lie Asp Glu 
145 150 155 1.60 

• * 

Lys Ser Thr Lys 'Ser "Leu Asn Arg Pro Tyr Ala Arg Val Asn Arg Lys 
' 165 170 175 

Gin Leu Lys Thr Lys Met Leu Gin Lys Cys lie Ala Asn Gly Val Lys 
180 185 190 

Phe His Gin Ala Lys Val lie Lys Val lie. His Glu Glu Leu Lys Ser 
195 ~ 200 205 

Leu Leu lie Cys Asn Asp Gly* Val Thr lie Gin Ala Thr Leu Val Leu 
210 215 220 

Asp Ala Thr Gly Pfte Ser Arg Ser Leu Val Gin Tyr Asp Lys Pro Tyr 
225 . 230 235 240 

Asn Pro Gly Tyr Gin Val Ala Tyr Gly lie Leu Ala Glu Val Glu Glu 
245 250 255 

His Pro Phe Asp Val Asp Lys Met Leu Phe Met Asp Trp Arg Asp Ser 
260 " 265 270 

His Leu Asp Gin Asn Leu Glu lie Lys Ala Arg Asn Ser Arg lie Pro 
275 280 285 

Thr Phe Leu Tyr Ala Met Pro Phe Ser Ser Thr Arg lie Phe Leu Glu 
290 295 300 

Glu Thr Ser Leu Val Ala Arg Pro Gly Leu Lys Met Glu Asp lie Gin 
305 310 315 320 

Glu Arg Met Ala Tyr Arg Leu Lys His Leu Gly lie Lys Val Lys Ser 
325 330 335 

lie Glu Glu Asp Glu Arg Cys Val lie Pro Met Gly Gly Pro Leu Pro 



IS 
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340 345 350 

Val Leu Pro Gin Arg Val Leu Gly lie Gly Glv Thr Ala Gly Met Val 

355 360 365 

His Pro Ser Thr Gly Tyr Met Val Ala Arg Thr Leu Ala Ala Ala Pro 

370 375 380 

lie Val Ala Lys Ser lie lie Arg Tyr Leu Asn Asn Glu Lys Ser Met 

385 390 ' 395 " 400 

Val Ala Asp Val Thr Gly Asp Asp Leu Ala Ala Gly lie Trp Arg Glu 

405 , 410 415 

Leu Trp 'Pro lie Glu Arg Arg Arg Gin Arg Glu Phe Phe Cys Phe Gly 

., 420 425 • 430 

Met Asp lie Leu Leu Lys Leu Asp Leu Glu Gly Thr Arg Arg Phe Phe 

435 440 445 

Asp Ala Phe Phe Asp Leu Glu Pro Arg Tyr Trp His Gly Phe Leu Ser 

4 50 ' 4 55 4 60 

Ser Arg Leu Phe Leu Pro Glu Leu Val Thr Phe Gly Leu Ser Leu Phe 

465 470 475 480 

Gly His Ala Ser Asn Thr Cys Arg Val Glu lie Met Ala Lys Gly Thr 

485 490 , 495 

Leu Pro Leu Ala Thr Met lie Gly Asn Leu Val Arg Asp Arg Glu 

500 505 ~ 510 



<210> 61 
<211> 503 
<212> PRT 
<213> Daffodil 

<400> 61 

Met Asp Tfcr Leu Leu Arg Thr His Asn Arg Leu Glu Leu Leu Tyr Pro 
15 10 15 

Leu His Glu >Leu Ala Lys Arg His Phe Leu Ser Pro Ser Pro Asn Pro 

20 ~ ~ 25 30 

Gin Asn Pro Asn Phe Lys Phe Phe Ser Arg Lys Pro Tyr Gin Lys Lys 
35 40 45 

Cys Arg Asn Gly Tyr lie Gly Val Ser Ser Asn Gin Leu Leu Asp Leu 
50 55 60 

Val Pro Glu lie Lys Lys Glu His Leu Glu Phe Asp Leu Pro Leu Tyr 
65 70 75 80 

Asp Pro Ser Lys Ala Leu Thr Leu Asp Leu Ala Val Val Gly Gly Gly 

85 90 95 

Pro Leu Ala Arg Ser Cys Ser Thr Ser Leu Gly Gly Gly Leu Ser Val 
100 105 110 

Val Ser lie Asp Pro Asn Pro Lys Leu lie Trp Pro Asn Asn Tyr Gly 



To 
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115 ' 120 ' • * 125 

Val Trp Vai Asp Glu Phe Glu Asp Met Asp Leu Leu Asp Cys Leu Asp 
130 ' 135 140 

Ala Thr Trp Ser Gly Ala lie Val Tyr Val Asp Asp Arg Ser Thr Lys 
145 ' 150 155 . 160 

Asn Leu Ser Arg Pro Tyr Ala Arg Val Asn Arg Lys Asn Leu Lys Ser 
165 170 175 

Lys Met Met Lys Lys Cys Val Ser Asn Gly Val Arg Phe His Gin Ala 
180 185 190 

Thr Val Val Lys Ala Met His Glu Glu Glu Lys Ser Tyr Leu lie Cys 

195 200 , 205. 

" * ■« 

Ser Asp Gly Val. Thr lie Asp Ala Arg Val Val Leu Asp Ala Thr Gly 

210 " , . 215 ( . 220 . ... 

Phe Ser Arg Cys Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr 
225 230 ' 235 ■ 240 

Gin Val Ala Tyr Gly lie Leu Ala Glu Val Glu Glu His Pro Phe Asp 
245 250 255 

Val Asp Lys Met Val , Phe Met Asp Trp Arg Asp Ser His Leu Asn Gly 
260' 265 270 

Lys Ala Glu Leu Asn Glu Arg Asn Ala Lys lie Pro Thr Phe Leu Tyr 
275 280 285 

Ala Met Pro Phe Ser Ser Asn Arg lie Phe Leu Glu 'Glu Thr Ser Leu 
290 295 300 

Val Ala Arg Pro Gly Leu 1 Lys Met Glu Asp lie Gin Glu Arg Met Val 
305 " 310 ' 315 320 

Ala Arg Leu Asn His Leu Gly lie Arg lie Lys Ser lie Glu Glu Asp 
325 330 335 

Glu Arg Cys Val Lie Pro Met Gly Gly Pro Leu Pro Val lie Pro Gin 
340 345 350 

Arg Val Val Gly lie Gly Gly Thr Ala Gly Met Val His Pro Ser Thr 
355 " * 360 365 

Gly Tyr Met Val Ala Arg Thr Leu Ala Ala Ala Pro lie Val Ala Asn 
370 " 375 380 

Ser lie Val Gin Tyr Leu Val Ser Asp Ser Gly Leu Ser Gly Asn Asp 
385 390 395 400 

Leu Ser Ala Asp Val Trp Lys Asp Leu Trp Pro lie Glu Arg Arg Arg 
405 410 415 

Gin Arg Glu Phe Phe Cys Phe Gly Met Asp lie Leu Leu Lys Leu Asp 
420 425 430 

Leu Glu Gly Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro 
4 35 44 0 44 5 
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Arg Tyr Trp His Gly Phe Leu Ser Ser Arg Leu Phe Leu Pro Glu Leu 
■350 455 460 

Val Pro Phe Gly Leu Ser Leu Phe Ser His Ala Ser Asn Thr Cys Lys 
465 470 475 480 

Leu 'Glu lie Met Ala Lys Gly Thr Leu Pro Leu Val Asn Met lie Asn 
485 490 495 

Asn Leu Val Gin Asp Arg Asp 
500 
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GENES OF CAROTENOED BIOSYNTHESIS AND METABOLISM 
AND METHODS OF USE THEREOF 

R ACIKdROI I MP OF THE INVENTION 
Field of the Invention 
The present invention describes nucleic acid sequences for eukaryotic genes encoding 
e lycopene e-cyclase (also known as e-cyclase and e lycopene cyclase), isopentenyl 
pyrophosphate isomerase (IPP) and P -carotene hydroxylase as well as vectors containing the 
same and hosts transformed with said vectors. The present invention also provides methods 
for augmenting the accumulation of carotenoids, changing the composition of the 
carotenoids, and producing hovel and rare carotenoids. The present invention provides 
methods for controlling the ratio or relative amounts of various carotenoids in a host. The 
invention also relates to. modified lycopene e-cyclase, IPP isomerase and p -carotene 
hydroxylase. Additionally, the present invention provides a method for screening for genes 
and cDNAs encoding'enzymes of carotenoid biosynthesis and metabolism. 

RarVr gronnH *f the Invention 
Carotenoid pigments with cyclic endgroups are essential components of the 
photosynthetic apparatus in oxygenic photosynthetic organisms (e.g., cyanobacteria, algae 
and plants; Goodwin, 1980). The symmetrical bicyclic yellow carotenoid pigment P- 
carotene (or, in rare cases, the asymmetrical bicyclic a -carotene) is intimately associated with 
the photosynthetic reaction centers and plays a vital role in protecting against potentially 
lethal photooxidative damage (Koyama, 1991). P -carotene and other carotenoids derived 
from it or from a -carotene also serve as light-harvesting pigments (Siefermann-Harms, 
1987), are involved in the thermal dissipation of excess light energy captured by the light- 
harvesting antenna (Demmig- Adams & Adams, 1992), provide substrate for the biosynthesis 
of the plant growth regulator abscisic acid (Rock & Zeevaart, 1991; Parry & Horgan, 1991), 
and are precursors of vitamin A in human and animal diets (Krinsky, 1987). Plants also 
exploit carotenoids as coloring agents in flowers and fruits to attract pollinators and agents of 
seed dispersal (Goodwin, 1980). The color provided by carotenoids is also of agronomic 
value in a number of important crops. Carotenoids are currently harvested from a variety of 
organisms, including plants, algae, yeasts, cyanobacteria and bacteria, for use as pigments in 
food and feed. 
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The' probable pathway for formation of cyclic carotenoids in plants, algae and 
cyanobacteria is illustrated in Figure 1 . Two types of cyclic endgroups or rings are M 
commonly found in higher plant carotenoids, these are referred to as the P (beta) and € 
(epsilon) rings (Fig. 3). The precursor acyclic endgroup (no ring structure) is referred to as 
5 the Y ipsi) endgroup. The p and e endgroups differ only in the position of the double bond in • 
the ring. Carotenoids with two p rings are ubiquitous, and those with one p and one e ring, 
are common, but carotenoids with two e rings are uncommon, p-carotene (Fig. 1) has two P- 
endgroups and is a symmetrical compound that is the precursor of a number of other 
important plant carotenoids such as zeaxanthin and violaxanthin (Fig. 2). 
1 o Genes encoding enzymes of carotenoid biosynthesis have previously been isolated 

from a variety of sources including bacteria (Armstrong et al., 1989, Mol. Gen. Genet. 216, 
254-268; Misawa et al., 1990, J. Bacteriol., 172, 6704-12), fungi (Schmidhauser et al., 1990, 
Mol. Cell. Biol. 10, 5064-70), cyanobacteria (Chamovitz et al., 1990, Z. Naturforsch, 45c, 
482-86; Cunningham et al., 1 994) and higher plants (Bartley et al., Proc. Natl. Acad. Sci USA 
15 88, 6532-36; Martinez-Ferez & Vioque, 1992, Plant Mol. Biol. 18,981-83). Manyofthe 
isolated enzymes show a great diversity in structure, function and inhibitory properties 
between sources. For example, phytoene desaturases from the cyanobacterium 
Synechococcus and from higher plants and green algae carry out a two-step desaturation to 
yield C -carotene as a reaction product. In plants and cyanobacteria a second enzyme (C- 
20 carotene desaturase), similar in amino acid sequence to the phytoene desaturase, catalyzes 

two additional desaturations to yield lycopene. In contrast, a single desaturase enzyme from 
Erwinia herbicola and from other bacteria introduces all four double bonds required to form 
lycopene. The Erwinia and other bacterial desaturases bear little amino acid sequence 
similarity to the plant and cyanobacterial desaturase enzymes, and are thought to be of 
25 unrelated ancestry. Therefore, even with a gene in hand from one source, it may be difficult to 
identify a gene encoding an enzyme of similar function in another organism. In particular, 
the sequence similarity between certain of the prokaryotic and eukaryotic genes encoding 
enzymes of carotenoid biosynthesis is quite low. 

Further, the mechanism of gene expression in prokaryotes and eukaryotes appears to 
30 differ sufficiently such that one cannot expect that an isolated eukaryotic gene will be 
properly expressed in a prokaryotic host. 
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The difficulties in isolating genes encoding enzymes with similar functions is 
exemplified by recent efforts to isolate the gene encoding the enzyme that catalyzes the 
formation of p-carotene from the acyclic precursor lycopene. Although a gene encoding an 
enzyme with this function had been isolated from a bacterium, it had riot been isolated from 
any photosynthetic procaryote or from any eukaryotic organism. The isolation and 
characterization of the enzyme catalyzing formation of p-carotene in the cyanobacterium 
Synechococcus PCC7942 was described by the present inventors and others (Cunningham et 
al., 1993 and 1994). The amino acid sequence similarity of the cyanobacterial enzyme to the 
various bacterial lycopene p-cyclases is so low {ca. 1 8-25% overall; Cunningham et al., 
1994) that there is much uncertainty as to whether they share a common ancestry or, instead, 
represent an example of convergent evolution. 

The need remains for the isolation of eukaryotic and prokaryotic genes andcDNAs 
encoding polypeptides involved in the carotenoid biosynthetic pathway, including those 
encoding a lycopene e-cyclase, IPP isomerase and p-carotene hydroxylase. There remains a 
need for methods to enhance the production of carotenoids, to alter the composition of 
carotenoids, and to reduce or eliminate carotenoid production. There also remains a need in 
the art for methods for screening for genes and cDN As encoding enzymes of carotenoid 
biosynthesis and metabolism. 

STTA/TMAKY O F T*TC TNVFNTTON 

Accordingly, a first object of this invention is to provide purified and/or isolated 
nucleic acids which encode enzymes involved in carotenoid biosynthesis; in particular, 
lycopene e-cyclase , IPP isomerase and p-carotene hydroxylase. 

A second object of this invention is to provide purified and/or isolated nucleic acids 
which encode enzymes which produce novel or uncommon carotenoids. 

A third object of the present invention is to provide vectors containing said genes. 

A fourth object of the present invention is to provide hosts transformed with said 
vectors. 

Another object of the present invention is to provide hosts which accumulate novel or 
uncommon carotenoids or which accumulate greater amounts of specific or total carotenoids. 

Another object of the present invention is to provide hosts with inhibited and/or 
altered carotenoid production. 
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Another object of this invention is to secure the expression of eukaryotic carotenoid- 
related genes in a recombinant prokaryotic host. # » 

Yet another object of the present invention is to provide a method for screening for 
eukaryotic and prokaryotic genes and cDNAs which encode enzymes involved in carotenoid 
5 biosynthesis and metabolism. 

An additional object of the invention is to provide a method for manipulating 
carotenoid biosynthesis in photosynthetic organisms by inhibiting the synthesis of certain 
enzymatic products to cause accumulation of precursor compounds. 

Another object of the invention is to provide modified lycopene e-cyclase, IPP 
10 isomerase and P -carotene hydroxylase. 

These and other objects of the present invention have been realized by the present 

inventors as described below. 

A subject of the present invention is an isolated and/or purified nucleic acid sequence 
which encodes for a protein having lycopene e-cyclase, IPP isomerase or p-carotene 
1 5 hydroxylase enzyme activity and having the amino acid sequence of SEQ ID NOS: 2, 4, 14- 
21 or 23-27. 

The invention also includes vectors which comprise any of the nucleic acid sequences 
listed above, and host cells transformed with such vectors. 

Another subject of the present invention is a method of producing or enhancing the 
20 production of a carotenoid in a host cell, comprising inserting into the host cell a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein having 
lycopene e-cyclase, IPP isomerase or p-carotene hydroxylase enzyme activity, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter; and expressing the 
heterologous nucleic acid sequence to produce the protein. 
25 Yet another subject of the present invention is a method of modifying the production 

of carotenoids in a host cell, the method comprising inserting into the host cell a vector 
comprising a heterologous nucleic acid sequence which produces an RNA and/or encodes for 
a protein which modifies lycopene e-cyclase, IPP isomerase or p-carotene hydroxylase 
enzyme activity, relative to an untransformed host cell, wherein the heterologous nucleic acid 
30 sequence is operably linked to a promoter; and expressing the heterologous nucleic acid 

sequence in the host cell to modify the production of the carotenoids in the host cell, relative 
to the untransformed host cell. 
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The present invention also includes a method of expressing, in a host cell, a 
heterologous nucleic acid sequence which encodes for a protein having lycopene e-cyclase, . 
IPP isomerase or p-carotene hydroxylase enzyme activity, the method comprising inserting 
into the host cell a vector comprising the heterologous nucleic acid sequence, wherein the 
5 heterologous nucleic acid sequence is operably linked to a promoter; and expressing the 
heterologous nucleic acid sequence. 

Also included is a method of expressing, in a host cell, a heterologous nucleic acid 
sequence which encodes for a protein which modifies lycopene e-cyclase, IPP isomerase or 
P-carotene hydroxylase enzyme activity in the host cell, relative to an untransformed host 
10 cell, the method comprising inserting into the host cell a vector comprising the heterologous 
nucleic acid sequence, wherein the heterologous nucleic acid sequence is operably linked to a 
promoter; and expressing the heterologous nucleic acid sequence. 

Another subject of the present invention is a method for screening for genes and 
cDNAs which encode enzymes involved in carotenoid biosynthesis and metabolism. 

15 pi? IFF DFSC.F TPTTfYM OF THF, DF. AWTNGS 

A more complete appreciation of the invention and many of the attendant advantages 
thereof will be readily obtained as the same becomes better understood by reference to the 
following detailed description when considered in connection with the accompanying 
drawings, wherein: 

20 Figure i is a schematic representation of the putative pathway of p -carotene 

biosynthesis in cyanobacteria, algae and plants. The enzymes catalyzing various steps are 
indicated at the left. Target sites of the bleaching herbicides NFZ and MPTA are also 
indicated at the left. Abbreviations: DMAPP, dimethylallyl pyrophosphate; FPP, famesyl 
pyrophosphate; GGPP, geranylgeranyl pyrophosphate; GPP, geranyl pyrophosphate; IPP, 
25 isopentenyl pyrophosphate; LCY, lycopene cyclase; MVA, mevalonic acid; MPTA, 2-(4- 
methylphenoxy)triethylamine hydrochloride; NFZ, norflurazon; PDS, phytoene desaturase; 
PSY, phytoene synthase; ZDS, C-carotene desaturase; PPPP, prephytoene pyrophosphate. 

Figure 2 depicts possible routes of synthesis of cyclic carotenoids and common plant 
and algal xanthophylls (oxycarotenolds) from neurosporene. Demonstrated activities of the 
30 P- and e-cyclase enzymes of A. thaliana are indicated by bold arrows labelled with P or e 
respectively. A bar below the arrow leading to e-carotene indicates that the enzymatic 



<WO 9963055A1 JA> 



WO 99/63055 

activity was examined but no product was detected. The steps marked by an arrow with a 
dotted line have not been specifically examined. Conventional numbering of the carbon 
atoms is given for neurosporene and ce-carotene. Inverted triangles (▼) mark positions of the 
double bonds introduced as a consequence of the desaturation reactions. 
5 Figure 3 depicts the carotene endgroups which are found in plants. 

Figure 4 is a DNA sequence and the predicted amino acid sequence of a lycopene e-, 
cyclase cDNA isolated from A. thaliana (SEQ ID NOS: 1 and 2). These sequences were 
deposited under Genbank accession number U5073 8. . This cDNA is incorporated into the 
plasmid pATeps. 

! 0 Figure 5 is a DNA sequence encoding the p -carotene hydroxylase isolated from A. 

thaliana (SEQ ID NO: 3). This cDNA is incorporated into the plasmid pATOHB. 

Figure 6 is an alignment of the predicted amino acid sequences of A. thaliana P- 

carotene hydroxylase (SEQ ID NO: 4) with those of the bacterial p -carotene hydroxylase 

enzymes from Alicalgenes sp. (SEQ ID NO: 5) (Genbank D58422), Erwinia herbicola EholO 
15 (SEQ ID NO.: 6) (GenBank M872280), Erwinia uredovora (SEQ ID NO.: 7) (GenBank 

D90087) and Agrobacterium aurianticum (SEQ ID NO.: 8) (GenBank D58420). A 

consensus sequence is also shown. All five genes are identical where a capital letter appears 

in the consensus. A lowercase letter indicates that three of five, including A. thaliana, have 

the identical residue. TM; transmembrane. 
20 Figure 7 is a DNA sequence of a cDNA encoding an IPP isomerase isolated from A. 

thaliana (SEQ ID NO: 9). This cDNA is incorporated into the plasmid pATDP5. 

Figure 8 is a DNA sequence of a second cDNA encoding another IPP isomerase 

isolated from A. thaliana (SEQ ID NO: 10). This cDNA is incorporated into the plasmid 

pATDP7. 

25 Figure 9 is a DNA sequence of a cDNA encoding an IPP isomerase isolated from 

Haematococcus pluvialis (SEQ ID NO: 11). This cDNA is incorporated into the plasmid 
pHP04. 

Figure 10 is a DNA sequence of a second cDNA encoding another IPP isomerase 
isolated from Haematococcus pluvialis (SEQ ID NO: 12). This cDNA is incorporated into 

30 the plasmid pHP05. 

Figure 1 1 is an alignment of the amino acid sequences predicted by IPP isomerase 
cDNAs isolated from A thaliana (SEQ ID NO.: 16 and 18), H. pluvialis (SEQ ID NOS.: 14 
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and 15), Clarkia breweri (SEQ ID NO.: 17) (See, Blanc & Pichersky, Plant Physiol. (1995) 
108:855; Genbank accession no. X82627) and Saccharomyces cerevisiae (SEQ ID NO.: 19). 
(Genbank accession no. J05090). 

Figure 12 is a DNA sequence of the cDN A encoding an IPP iSomerase isolated from 
5 Tagetes erecta (marigold; SEQ ID NO: 1 3). This cDNA is incorporated into the plasmid 

pPMDPl . xxx's denote a region not originally sequenced. Figure 21 A shows the complete 
marigold sequence. 

Figure . 13 is an alignment of the consensus sequence of four plant p-cyclases (SEQ ID 
NO.: 20) witfrthe A. thaliana lycopene e-cyclase (SEQ ID NO.: 21). A capital letter in the 
10 plant p consensus is used where all four P -cyclase genes predict the same amino acid residue 
in this position. A small letter indicates that an identical residue was found in three of the 
four. Dashes indicate, that the amino acid residue was not conserved and dots in the sequence 
denote a gap. A consensus for the aligned sequences is given, in capital letters below the 
alignment, where the'p- and e-cyclases have the same amino acid residue. Arrows indicate 
15 some of the conserved amino acids that will be used as junction sites for construction of 
chimeric cyclases with novel enzymatic activities. Several regions of interest including a 
sequence signature indicative of a dinucleotide^binding motif and two predicted 
transmembrane (TM) helical regions are indicated below the alignment and are underlined. 

Figure 14 shows the nucleotide (SEQ ID NO:22) and amino acid sequences (SEQ ID 
20 NO:23) of the Adonis palaestina (pheasant's eye) e-cyclase cDNA #5. 

Figure 1 5A shows the nucleotide (SEQ ID NO:24) and amino acid sequences (SEQ 
ID NO:25) of a potato e-cyclase cDNA. Figure 15B shows the amino acid sequence (SEQ ID 
NO:26) of a chimeric lettuce/potato lycopene e-cyclase. Amino acids in lower case are from 
the lettuce cDNA and those in upper case are from the potato cDNA. The product of this 
25 chimeric cDNA has e-cyclase activity and converts lycopene to the monocyclic 8-carotene. 

Figure 16 shows a comparison between the amino acid sequences of the Arabidopsis 
e-cyclase (SEQ ID NO:27) and the potato e-cyclase (SEQ ID NO:25). 

Figure 17A shows the nucleotide sequence of the Adonis palaestina Ipil (SEQ ID 
NO:28) and Figure 17B shows the nucleotide sequence of the Adonis palaestina Ipi2 (SEQ 
30 ID NO: 29). 
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Figure 1 8A shows the nucleotide sequence of the Haematoccus pluvialis Ipil (SEQ 

ID NO- 1 1 ) and Figure 1 8B shows the nucleotide sequence of the Haematoccus pluvialis Ipi2 m 

♦ 

(SEQ ID NO:30). 

Figure 19A shows the nucleotide sequence of the Lactuca sativa (romaine lettuce) 
5 Ipil (SEQ ID NO:3 1 ) and Figure 1 9B shows the nucleotide sequence of the Lactuca sativa 

Ipi2 (SEQ ID NO: 32). 

Figure 20 shows the nucleotide sequence of the Chlamydomonas reinhardtii Ipil 

(SEQ ID NO:33). 

Figure 21 A shows the nucleotide sequence of the Tagetes erecta (marigold) Ipil (SEQ 
10 ID NO:34) and Figure 2 1 B shows the nucleotide sequence of the Oryza sativa (rice) Ipi 1 
(SEQ ID NO:35). 

Figure 22 shows a amino acid sequence alignment of various plant and green algal 
isopentenyl isomerases (IPI) (SEQ ID NOS: 1 6, 36-45). 

Figure 23 shows a comparison between Adonis palaestina e-cyclase cDNA #3 and 
1 5 cDNA #5 nucleotide sequences. 

Figure 24 shows a comparison between Adonis palaestina e-cyclase cDNA #3 and 
cDNA #5 predicted amino acid sequences. 

Figure 25 shows a sequence alignment of various plant P- and e-cyclases. Those 
sequences outlined in grey denote identical sequences among the e-cyclases. Those 
20 sequences outlined in black denote identical sequences among both the P- and e-cyclases. 

Figure 26 shows a sequence alignment of the plant e-cyclases from Figure 25. Those 
sequences outlined in black denote identical sequences among the e-cyclases. 

Figure 27 is a dendrogram or "tree" illustrating the degree of amino acid sequence 
similarity for various lycopene P- and e-cyclases. 
25 Figure 28 shows a comparison between Arabidopsis e-cyclase and lettuce e-cyclase 

predicted amino acid sequences. 

r>FsrRTPTTQ>J OF THE EREEEKRED EMBODIMENTS 
The present invention includes an isolated and/or purified nucleic acid sequence 
which encodes for a protein having lycopene e-cyclase, IPP isomerase or p-carotene 
30 hydroxylase enzyme activity and having the amino acid sequence of SEQ ID NOS: 2, 4, 14- 
21, 23 or 25-27. Nucleic acids encoding lycopene e-cyclase, p-carotene hydroxylase and IPP 
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isomerases have been isolated from several' genetically distant sources. 

The present inventors have isolated nucleic acids encoding the enzyme IPP isomerase, 
which catalyzes the reversible conversion of isopentenyl pyrophosphate (IPP) to 
dimethylallyl pyrophosphate (DMAPP). IPP isomerase cDNAs were isolated from the plants 
5 A. thaliana, Tagetes erecta (marigold), Adonis palaestina (pheasant's eye), Lactuca sativa 
(romaine lettuce) and from the green algae H. pluvialis and Chlamydomonas reinhardtii. 
Alignments of the amino acid sequences predicted by some of these cDNAs are shown in 
Figures 12 and 22. Plasmids containing some of these cDNAs were deposited with the 
American Type Culture Collection, 12301 Parklawn Drive, Rockville MD 20852 on March 4, 
0 1996 under ATCC accession numbers 98000 (pHP05 -H. pluvialis); 98001 (pMDPl - . 
marigold); 98002 (pATDP7 - A. thaliana) and 98004 (pHP04 - H. pluvialis). 

The present inventors have also isolated nucleic acids encoding the enzyme, P- 
carotene hydroxylase, which is responsible for hydroxylating the p-endgroup in carotenoids. 
The nucleic acid of the present invention is shown in SEQ ID NO: 3 and Figure 5. The full 
1 5 length cDN A product hydroxylates both end groups of p -carotene as do products of cDN As 
which encode proteins truncated by up to 50 amino acids from the N-terminus. Products of 
genes which encode proteins truncated between about 60-1 1 0 amino acids from the N- 
terminus preferentially hydroxylate only one ring. A plasmid containing this gene was 
deposited with the American Type Culture Collection, 12301 Parklawn Drive, Rockville MD 
20 20852 on March 4, 1996 under ATCC accession number 98003 (pATOHB - A. thaliana). 

The present inventors have also isolated nucleic acids encoding the enzyme lycopene 
e-cyclase, which is responsible for the formation of e-endgroups in carotenoids. The A. 
thaliane e-cyclase adds an e ring to only one end of the symmetrical lycopene while the 
related p -cyclase adds a ring at both ends. The A. thaliana cDNA of the present invention is 
25 shown in Figure 4 and SEQ ID NO: 1 . A plasmid containing this gene was deposited with 
the American Type Culture Collection, 12301 Parklawn Drive, Rockville MD 20852 on 
March 4, 1996 under ATCC accession number 98005 (pATeps - A. thaliana). 

In addition, lycopene e-cyclases have been identified in lettuce and in Adonis 
palaestina (cDNA #5) which encode enzymes that convert lycopene to the bicyclic e- 
30 carotene (s,e-carotene). An additional cDNA from Adonis palaestina (cDN A #3) encodes a 
lycopene e-cyclase which converts lycopene into 6-carotene (e,^ -carotene) and differs from 
the lycopene e-cyclase which forms bicyclic e-carotene (e,e-carotene) by only 5 amino acids. 

-9- 



<WO 9963055A1_IA> 



WO 99/63055 




PCT/US99/12121 



One or mo^e of these amino acids may be modified by alteration of the nucleotide sequence 
in the #5 cDNA to obtain an enzyme which forms the bicyclic e,e-carotene. The sequences 
of the Adonis palaestina and Arabidopsis thaliana e-cyclases have about 70% nucleotide 
identity and about 72% amino acid identity. 

Initial experiments by the inventors with chimeric genes indicated that the part of the . 
e-cyclase which is responsible for adding 2 e rings to form e,e-carotene is the carboxy 
terminal portion of the gene. The lettuce e-cyclase adds two e rings to form e,e-carotene. A 
DNA encoding a partial potato e-cyclase (missing its amino terminal portion), when 
combined with an amino terminal region from the lettuce e-cyclase gene, produces a 
monocyclic 6 -carotene (e,\|/-carotene). With the discovery of the differences between the 
Adonis palaestina clone #3 and clone #5, the specific amino acids responsible for the addition 
of an extra e ring have been identified (Figure 24). Specifically, amino acid 55 is Thr in 
clone #3 and Ser in clone #5, amino acid 210 is Asn in clone #3 and Asp in clone #5, amino 
acid 231 is Asp in clone #3 and Glu in clone #5, amino acid 352 is He in clone #3 and Val in 
clone #5, and amino acid 524 is Lys in clone #3 and Arg in clone #5. It can be appreciated 
that these changes are quite conservative, as only one change, at amino acid 210, changes the 
charge of the protein. 

Thus, it is clear that the nucleic acids of the invention encoding the enzymes as 
presently disclosed may be altered to increase a particularly desirable property of the enzyme, 
20 to change a property of the enzyme, or to diminish an undesirable property of the enzyme. 

Such modifications can be by deletion, substitution, or insertion of one or more amino acids, 
and can be performed by routine enzymatic manipulation of the nucleic acid encoding the 
enzyme (such as by restriction enzyme digestion, removal of nucleotides by mung bean 
nuclease or Bal3l, insertion of nucleotides by KJenow fragment, and by religation of the 
25 ends), by site-directed mutagenesis, or may be accidental, such as by low fidelity PCR or 
those obtained through mutations in hosts that are producers of the enzymes. These 
techniques as well as other suitable techniques are well known in the art. 

Mutations can be made in the nucleic acids of the invention such that a particular 
codon is changed to a codon which codes for a different amino acid. Such a mutation is 
30 generally made by making the fewest nucleotide changes possible. A substitution mutation 
of this sort can be made to change an amino acid in the resulting protein in a non- 
conservative manner (i.e., by changing the codon from an amino acid belonging to a grouping 
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of amino acids having a particular size or characteristic to an amino acid belonging to another 
grouping) or in a conservative manner (i.e., by changing the codon from an amino acid 
belonging to a grouping of amino acids having a particular size or characteristic to an amino 
acid belonging to the same grouping). Such a conservative change generally leads to less 

5 change in the structure and function of the resulting protein. A non-conservative change is 
more likely to alter the structure, activity or function of the resulting protein. The present 
invention should be considered to include sequences containing conservative changes which 
do not significantly alter the activity or binding characteristics of the resulting protein. 
The following is one example of various groupings of amino acids: 

10 Anii nn o.;h<: with DODpola^B <™ms: Alanine; Valine, Leucine, Isoleucine, Proline, 
Phenylalanine, Tryptophan and Methionine. 

jWLariri- vith uncharged polar* ^nun. Glycine, Serine, Threonine, Cysteine, Tyrosine, 
Asparagine and Glutamine. 

^ mi nn „m« with ch arged polarE groups (negatively charged at Ph 6.0): Aspartic acid and 

15 Glutamic acid. 

Basic o^.nn .rids (positively charged at pH 6.0): Lysine, Arginine and Histidine. 

Another grouping may be those amino acids with phenyl groups: Phenylalanine, 

Tryptophan and Tyrosine. 

Another grouping may be according to molecular weight (i.e., size of R groups). 

20 Particularly preferred substitutions are: 

- Lys for Arg and vice versa such that a positive charge may be maintained; 

- Glu for Asp and vice versa such that a negative charge may be maintained; 

- Ser for Thr such that a free -OH can be maintained; and 

- Gin for Asn such that a free NH 2 can be maintained. 

2 5 Amino acid substitutions may also be introduced to substitute an amino acid with a 

particularly preferable property. For example, a Cys may be introduced to provide a potential 
site for disulfide bridges with another Cys. A His may be introduced as a particularly 
"catalytic" site (i.e., His can act as an acid or base and is the most common amino acid in 
biochemical catalysis). Pro may be introduced because of its particularly planar structure, 

30 which induces P -turns in the protein's structure. 

It is clear that certain modifications of SEQ ID NOS: 2, 4, 14-21 , 23 or 25-27 can take 
place without destroying the activity of the enzyme. It is noted especially that truncated 
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versions of the nucleic acids of the invention are functional. For example, several amino 
acids (from 1 to about 120) can be deleted from the N-terminus of the lycopene e-cyclases of 
the invention, and a functional protein can still be produced. This fact is made especially 
clear from Figure 25, which shows a sequence alignment of several plant e-cyclases. As can 
5 be seen from Figure 25, there is an enormous amount of sequence disparity between amino 
acid sequences 2 to about 50-70 (depending on the particular sequence, since gaps are 
present). There is less, but also a substantial amount of, sequence dissimilarity between about 
50-70 to about 90-120 (depending on the particular sequence). Thereafter, the sequences are 
fairly conserved, except for small pockets of dissimilarity between about 275-295 to about 

10 285-305 (depending on the particular sequence), and between about 395-415 to about 410- 
430 (depending on the particular sequence). 

The present inventors have found that the amount of the 5' region present in the 
nucleic acids of the invention can alter the activity of the enzyme. Instead of diminishing 
activity, truncating the 5' region of the nucleic acids of the invention may result in an enzyme 

15 with a different specificity. Thus, the present invention relates to nucleic acids and enzymes 
encoded thereby which are truncated to within 0-50, preferably 0-25, codons of the 5* 
initiation codon of their prokaryotic counterparts as determined by alignment maps as 
discussed below. 

For example, when the cDNA encoding A. thaliana p -carotene hydroxylase was 
20 truncated, the resulting enzyme catalyzed the formation of p-cryptoxanthin as the major 

product and zeaxanthin as minor product; in contrast to its normal production of zeaxanthin. 

The present invention is intended to include those nucleic acid and amino acid 
sequences in which substitutions, deletions, additions or other modifications have taken 
place, as compared to SEQ ID NOS: 2, 4, 14-21, 23 or 25-27, without destroying the activity 
25 of the enzyme. Preferably, the substitutions, deletions, additions or other modifications take 
place at the 5' end, or any other of those positions which already show dissimilarity between 
any of the presently disclosed amino acid sequences (see also Figure 25) or other amino acid 
sequences which are known in the art and which encode the same enzyme (i.e., lycopene e- 
cyclase, IPP isomerase or p -carotene hydroxylase). 
30 In each case, nucleic acid and amino acid sequence similarity and identity is measured 

using sequence analysis software, for example, the Sequence Analysis, Gap, or BestFit 
software packages of the Genetics Computer Group (University of Wisconsin Biotechnology 
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Center, 1710 University Avenue, Madison^ -Wisconsin 53705), MEGAlign (DNAStar, Inc., 
1228 S. Park St., Madison, Wisconsin 53715), or MacVector (Oxford Molecular Group, 2105 
S. Bascom Avenue, Suite 200, Campbell, California 95008). Such software uses algorithms 
to match similar sequences by assigning degrees of identity to various. substitutions, 
5 deletions, and other modifications, and includes detailed instructions as to useful parameters, 
1 etc., such that those of routine skill in the art can easily compare sequence similarities and 

identities. An example of a useful algorithm in this regard is the algorithm of Needleman and 
Wunsch, which is used in the Gap program discussed above. This program finds the 
alignment of two complete sequences that maximizes the number of matches and minimizes 
10 the number of gaps Another useful algorithm is the algorithm of Smith and Waterman, 
which is used in the BestFit program discussed above. This program creates an optimal 
alignment of the best segment of similarity between two sequences. Optimal alignments are 
found by inserting gaps to maximize the number of matches using the local homology 
algorithm of Smith and Waterman. 
15 Conservative (i.e. similar) substitutions typically include substitutions within the 

following groups: glycine and alanine; valine, isoleucine and leucine; aspartic acid, glutamic 
acid, asparagine and glutamine; serine and threonine; lysine and arginine; and phenylalanine 
and tyrosine. Substitutions may also be made on the basis of conserved hydrophobicity or 
hydrophilicity (see Kyte and Doolittle, J. Mol Biol 157: 105-132 (1982)), or on the basis of 
20 the ability to assume similar polypeptide secondary structure (see Chou and Fasman, Adv. 
Enzymol 47: 4$t148 (1978)). 

If comparison is made between nucleotide sequences, preferably the length of 
comparison sequences is at least 50 nucleotides, more preferably at least 60 nucleotides, at 
least 75 nucleotides or at least 100 nucleotides. It is most preferred if comparison is made 
25 between the nucleic acid sequences encoding the enzyme coding regions necessary for 

enzyme activity. If comparison is made between amino acid sequences, preferably the length 
of comparison is at least 20 amino acids, more preferably at least 30 amino acids, at least 40 
amino acids or at least 50 amino acids. It is most preferred if comparison is made between 
the amino acid sequences in the enzyme coding regions necessary for enzyme activity. 
30 It should be appreciated that also within the scope of the present invention are nucleic 

acid sequences encoding lycopene e-cyclases, IPP isomerases and p -carotene hydroxylases 
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which code for enzymes having the same amino acid sequence as SEQ ID NO& 2, 4, 14-21 , 
23 or 25-27, but which are degenerate to the nucleic acids specifically disclosed herein: 

The amino acid residues described herein are preferred to be in the "L" isomeric form. 
However, residues in the "D" isomeric form can be substituted for any L-amino acid residue, 
as long as the desired functional property of immunoglobulin-binding is retained by the 
polypeptide. 

In accordance with the present invention there may be employed conventional 
molecular biology, microbiology, and recombinant DNA techniques within the skill of the 
art. Such techniques are explained fully in the literature. See, e.g., Sambrook et al, 
"Molecular Cloning: A Laboratory Manual" (1989); "Current Protocols in Molecular 
Biology" Volumes I-III [Ausubel, R. M. ? ed. (1994)]; "Cell Biology: A Laboratory 
Handbook" Volumes I-III [J. E. Celis, ed. (1994))]; "Current Protocols in Immunology" 
Volumes I-III [Coligan, J. E., ed. (1994)]; "Oligonucleotide Synthesis" (M.J. Gait ed. 1984); 
"Nucleic Acid Hybridization" [B.D. Hames & S.J. Higgins eds. (1985)]; "Transcription And 
Translation" [B.D. Hames & S.J. Higgins, eds. (1984)]; " Animal Cell Culture" [R.I. 
Freshney, ed. (1986)]; "Immobilized Cells And Enzymes" [IRL Press, (1986)]; B. Perbal, "A 
Practical Guide To Molecular Cloning" (1 984). 

The present invention also includes vectors. Suitable vectors according to the present 
invention comprise a nucleic acid of the invention encoding an enzyme involved in 
carotenoid biosynthesis or metabolism and a suitable promoter for the host, and can be 
constructed using techniques well known in the art (for example Sambrook et al., Molecular 
Cloning A T Moratory Manual , Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 
1989; Ausubel et al., Current Protocols in Molecular Biolog y. Greene Publishing and Wiley 
Interscience, New York, 1991). Suitable vectors for eukaryotic expression in plants are 
described in Frey et al., Plant J. (1995) 8(5):693 and Misawa et al, 1994a; incorporated herein 
by reference. Suitable vectors for prokaryotic expression include pACYC184, pUCl 19, and 
pBR322 (available from New England BioLabs, Bevery, MA) and pTrcHis (Invitrogen) and 
pET28 (Novagen) and derivatives thereof. The vectors of the present invention can 
additionally contain regulatory elements such as promoters, repressors, selectable markers 
such as antibiotic resistance genes, etc. 

The nucleic acids encoding the carotenoid enzymes as described above, when cloned 
into a suitable expression vector, can be used to overexpress these enzymes in a plant 
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expression system or to inhibit the expression of these enzymes. For example, a vector 
containing, the gene encoding lycopene e-cyclase can be used to increase the amount of cc- 
carotene and carotenoids derived from a-carotene (such as lutein and ct-cryptoxanthin) in an 
organism and thereby alter the nutritional value, pharmacology and visual appearance value 
5 of the organism. 

Therefore, the present invention includes a method of producing or enhancing the 
production of a carotenoid in a host cell, relative to an untransformed host cell, the method 
comprising inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having lycopene e-cyclase, 1PP isomerase or P- 

10 carotene hydroxylase enzyme activity, wherein the heterologous nucleic acid sequence is ( 
operably linked to a promoter; and expressing the heterologous nucleic acid sequence to 
produce the protein. 

The present invention also includes a method of modifying the production of 
carotenoids in a host qell, the method comprising inserting into the host cell a vector 

15 comprising a heterologous nucleic acid sequence which produces an RNA and/or encodes for 
a protein which modifies lycopene e-cyclase, IPP isomerase or p -carotene hydroxylase 
enzyme activity, relative to an untransformed host cell,, wherein the heterologous nucleic acid 
sequence is operably linked to a promoter; and expressing the heterologous nucleic acid 
sequence in the host cell to modify the production of the carotenoids in the host cell, relative 

20 to the untransformed host cell. 

The term "modifying the production" means that the amount of carotenoids produced 
in the host cell can be enhanced, reduced, or left the same, as compared to the untransformed 
host cell. In accordance with one embodiment of the present invention, the make-up of the 
carotenoids (i.e., the specific carotenoids produced) is changed vis a vis each other, and this 

25 change in make-up may result in either a net gain, net loss, or no net change in the total 

amount of carotenoids produced in the cell. In accordance with another embodiment of the 
present invention, the production or the biochemical activity of the carotenoids (or the 
enzymes which catalyze their formation) is enhanced by the insertion of an enzyme-encoding 
nucleic acid of the invention. In yet another embodiment of the invention, the production or 

30 the biochemical activity of the carotenoids (or the enzymes which catalyze their formation) 
may be reduced or inhibited by a number of different approaches available to those skilled in 
the art, including but not limited to such methodologies or approaches as anti-sense (e.g., 
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Gray et al (1992) Plant Mol. Biol. 19:69-87), ribozymes (e.g., Wegener et al (1994) Mol. 

♦ • . 

Gen. Genet. 245:465-470), co-suppression (e.g., Fray and Grierson (1993) Plant Mol. Biol/ 
22:589-602), targeted disruption of the gene (e.g., Schaefer et al. (1997) Plant J. 1 1:1 195- * 
1206), intracellular antibodies (e.g., Rondon and Marasco (1997) Ann. Rev. Microbiol. 
5 5 1 :257-283) or whatever other approaches rely on the knowledge or availability of the nucleic , 
acid or amino acid sequences of the invention and/or portions thereof, to thereby reduce 
accumulation of carotenoids with € rings and compounds derived from them (for e-cyclase 
inhibition), or carotenoids with hydroxylated P rings and compounds derived from them (for ' 
p -hydroxylase inhibition), or, in the case if IPP isomerase, accumulation of any isoprenoid 
IP compound. 

Preferably, at least a portion of the nucleic acid sequences used in the methods, 
vectors and host cells of the invention codes for an enzyme having an amino acid sequence 
which is at least 85% identical, preferably at least 90%, at least 95% or completely identical 
to SEQ ID NOS: 2, 4, 14-21, 23 or 25-27. Sequence identity is determined as noted above. 

15* Preferably, sequence additions, deletions or other modifications are made as indicated above, 
so as to not affect the function of the particular enzyme. 

In a preferred embodiment, vectors are manufactured which contain a DNA encoding 
a eukaryotic IPP isomerase upstream of a DNA encoding a second eukaryotic carotenoid 
enzyme. The inventors have discovered that inclusion of an EPP isomerase gene increases the . 

20 supply of substrate for the carotenoid pathway; thereby enhancing the production of 

carotenoid endproducts, as compared to a host cell which is not transformed with such a 
vector. This is apparent from the much deeper pigmentation in carotenoid-accumulating 
colonies of E. coli which also contain one of the aforementioned IPP isomerase genes when 
compared to colonies that lack this additional IPP isomerase gene. Similarly, a vector 

25 comprising an IPP isomerase gene can be used to enhance production of any secondary 
metabolite of dimethylallyl pyrophosphate and/or isopentenyl pyrophosphate (such as 
isoprenoids, steroids, carotenoids, etc.). The term "isoprenoid" is intended to mean any 
member of the class of naturally occurring compounds whose carbon skeletons are composed, 
in part or entirely, of isopentyl C 5 units. Preferably, the carbon skeleton is of an essential oil, 

30 a fragrance, a rubber, a carotenoid, or a therapeutic compound, such as paclitaxel. 

A vector containing the cDNA encoding a lycopene e-cyclase of the invention, 
preferably the lettuce lycopene e-cyclase or Adonis e-cyclase #5, can be used to increase the 
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amount of bicyclic e-carotene in an organism and thereby alter the nutritional value, 
pharmacology and visual appearance value of the organism. In addition, the transformed, 
organism can be used in the formulation of therapeutic agents, for example in the treatment of* 
cancer (see Mayne et al (1996) FASEB J. 10:690-701 ; Tsushima et al (1995) Biol. Pharm. 
5 Bull. 18:227-233). 

An antisense strand of a nucleic acid of the invention can be inserted into a vector. 
For example, the lycopene e-cyclase gene can be inserted into a vector and incorporated into 
the genomic DNA of a host, thereby inhibiting the synthesis of e,P -carotenoids (lutein and a-, 
carotene) and enhancing the synthesis of p,P -carotenoids (zeaxanthin and p -carotene). 

10 The present invention also relates to novel enzymes which are encoded by the amino 

acid sequences of the invention, or portions thereof. 

The present invention also relates to novel enzymes which can transform known 
carotenoids into novel or uncommon products. Currently e-carotene (see Figure 2) and y- 
carotene are commonly produced only in minor amounts. As described below, an enzyme 

1 5 can be produced which transforms lycopene to y -carotene and lycopene to e-carotene. With 
these products in hand, bulk synthesis of other carotenoids derived from them are possible. 
For example, e-carotene can be hydroxylated to form lactucaxanthin, an isomer of lutein (one 
e and one P ring) and zeaxanthin (two p rings) where both endgroups are, instead, e rings. 

In addition to novel enzymes produced by truncating the 5' region of known enzymes, 

20 as discussed above, novel enzymes which can participate in the formation of unusual 

carotenoids can be formed by replacing portions of one gene with an analogous sequence 
from a structurally related gene. For example, p -cyclase and e-cyclase are structurally 
related (see Figure 13). By replacing a portion of p -lycopene cyclase with the analogous 
portion of e-cyclase, an enzyme which produces y -carotene will be produced (one P 

25 endgroup). Further, by replacing a portion of the lycopene e-cyclase with the analogous 
portion of p -cyclase, an enzyme which produces e-carotene will be produced (with some 
exceptions, such as the lettuce e-cyclase, plant e-cyclases normally produce a compound with 
one e-endgroup, 6 -carotene). Similarly, P -hydroxylase could be modified to produce 
enzymes of novel function by creation of hybrids with e-hydroxylase. 

30 Host systems according to the present invention can comprise any organism that 

already produces carotenoids or which has been genetically modified to produce carotenoids. 
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The IPP isomerase genes are more broadly applicable for enhancing production of any 
product dependent on DMAPP and/or IPP as a precursor. 

Organisms which already produce carotenoids include plants, algae, some yeasts, 
fungi and cyanobacteria and other photosynthetic bacteria. Transformation of these hosts 
5 with vectors according to the present invention can be done using standard techniques such as 
those described in Misawa et al., (1990) supra; Hundle et al., (1993) supra; Hundle et ah, 
(1991) supra; Misawa et al., (1991) supra; Sandmann et al., supra; and Schnurr et al., supra. 

Transgenic organisms can be constructed which include the nucleic acid sequences of 
the present invention (Bird et al, 1991; Bramley et al, 1992; Misawa et al, 1994a; Misawa et 
10 al, 1994b; Cunningham et al„ 1993). The incorporation of these sequences can allow the 
controlling of carotenoid biosynthesis, content, or composition in the host cell. These 
transgenic systems can be constructed to incorporate sequences which allow for the 
overexpression of the nucleic acids of the present invention. Transgenic systems can also be 
constructed containing aptisense expression of the . nucleic acid sequences of the present 
15 invention. Such antisense expression would result in the accumulation of the substrates of the 
substrates of the enzyme encoded by the sense strand: 

A method for screening for eukaryotic genes which encode enzymes involved in 
carotenoid biosynthesis comprises transforming a prokaryotic host with a nucleic acid which 
may contain a eukaryotic or prokaryotic carotenoid biosynthetic gene; culturing said 
20 transformed host to obtain colonies; and screening for colonies exhibiting a different color 
than colonies of the untransformed host. 

Suitable hosts include E. coli, cyanobacteria such as Synechococcus and 
Synechocystis, alga and plant cells. E. coli are preferred. 

In a preferred embodiment, the above "color complementation" screening protocol can 
25 be enhanced by using mutants which are either (1) deficient in at least one carotenoid 

biosynthetic gene or (2) overexpress at least one carotenoid biosynthetic gene. In either case, 
such mutants will accumulate carotenoid precursors. 

Prokaryotic and eukaryotic DNA or cDNA libraries can be screened in total for the 
presence of genes of carotenoid biosynthesis, metabolism and degradation. Preferred 
30 organisms to be screened include photosynthetic organisms. 
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E. coli. can be transformed with these eukaryotic cDNA libraries using .conventional 
« • 

methods such as those described in Sambrook et al, 1989 and according to protocols . 

»■ 

described by the vendors of the cloning vectors. * 
For example, the cDNA libraries in bacteriophage vectors such as lambdaZAP 
5 (Stratagene) or lambda ZEPLOX (Gibco BRL) can be excised en masse and used to transform ( 
E.coli. 

Transformed E. coli can be cultured using conventional techniques. The culture broth 
preferably contains antibiotics to select and maintain plasmids. Suitable antibiotics include 
penicillin, ampicillin, chloramphenicol, etc. Culturing is typically conducted at 15-40°C, 
.10 preferably at room temperature or slightly above (1 8-28°C), for 12 hours to 7 days. 

Cultures are plated and the plates are screened visually for colonies with a different 
color than the colonies of the host E. coli transformed with the empty plasmid cloning vector. 
For example, E. coli transformed with the plasmid, pAC-BETA (described below), produce 
yellow colonies that accumulate p -carotene. After transformation with a cDNA library, 
15 colonies which contain a different hue than those formfcd by E. co/z/pAC-BETA would be 
expected to contain enzymes which modify the structure or accumulation of P -carotene. 
Similar E. coli strains can be engineered which accumulate earlier products in carotenoid 
biosynthesis, such as lycopene, y -carotene, etc. 

Having generally described this invention, a further understanding can be obtained by , 
20 reference to certain specific examples which are provided herein for purposes of illustration 
only and are not intended to be limiting unless otherwise specified. 

EXAMPLE 

I. Isolation of p -carotene hydroxylase 
Plasmid C onstruction 

25 An 8.6kb BgUI fragment containing the carotenoid biosynthetic genes of Erwinia 

herbicola was first cloned in the BamHI site of plasmid vector pACYC184 (chloramphenicol 
resistant), and then a 1 .Ikb BamHI fragment containing the E. herbicola p -carotene 
hydroxylase (CrtZ) was deleted. E.coli strains containing the resulting plasmid, pAC-BETA, 
accumulate B-carotene and form yellow colonies (Cunningham et al., 1994). 

30 A full length cDNA encoding IPP isomerase of Haematococcus pluvialis (HP04) was 

first excised with BamHI and Kpnl from pBluescript SK-, and then ligated into the 
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corresponding sites of the pTrcHisA vector with high-level expression from the trc promoter 
(Invitrogen, Inc.). A fragment containing the IPP isomerase and trc promoter was 
subsequently excised with EcoRV and Kpn\, treated with the KJenow fragment of DNA 
polymerase to produce blunt ends, and ligated in the Kl enow-treated Hindlll site of pAC- 
5 BETA. E.coli cells transformed with this new plasmid pAC-BETA-04 form orange colonies 
on LB plates (vs. yellow for those containing pAC-BETA) and cultures accumulate 
substantially more p -carotene (ca. two fold) than those that contain pAC-BETA. 

Screening of an Arabidopsis c DNA T Jhrary 

Several X cDNA expression libraries of Arabidopsis were obtained from the 

10 Arabidopsis Biological Resource Center (Ohio State University, Columbus, OH) (Kieber et 
al., 1993). The X cDNA libraries were excised in vivo using Stratagene's ExAssist SOLR 
system to produce a phagemid cDNA library wherein each phagemid contained also a gene 
conferring resistance to, the antibiotic ampicillin. , 

E.coli strain DH10BZIP was chosen as the host cell for the screening and pigment 

1 5 production, although we have also used TOPI OF' and XL1 -Blue for this purpose. DH1 OB 
cells were transformed with plasmid pAC-BETA-04 and were plated on LB agar plates 
containing chloramphenicol at 50 |ig/ml (from United States Biochemical Corporation). The 
phagemid Arabidopsis cDNA library was then introduced into DH10B cells already 
containing pAC-BETA-04. Transformed cells containing both pAC-BETA-04 and 

20 Arabidopsis cDNA library phagemids were selected on chloramphenicol plus ampicillin (150 
jig/ml) agar plates. Maximum color development occurred after 3 to 7 days incubation at 
room temperature, and the rare bright yellow colonies were selected from a background of 
many thousands of orange colonies on each agar plate. Selected colonies were inoculated 
into 3 ml liquid LB medium containing ampicillin and chloramphenicol, and cultures were 

25 incubated at room temperature for 1-2 days, with shaking. Cells were then harvested by 
centrifugation and extracted with acetone in microfuge tubes. After centrifugation, the 
pigmented extract was spotted onto silica gel thin-layer chromatography (TLC) plates, and 
developed with a hexanerether (1 : 1 , by volume) mobile phases. B-carotene hydroxylase- 
encoding cDNAs were identified based on the appearance of a yellow pigment that co- 

30 migrated with zeaxanthin on the TLC plates. 
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Suhcloning and Sequencing 

The plasmid containing the 6-carotene hydroxylase cDNA was recovered and 
analyzed by standard procedures (Sambrook et al., 1989). The Arabidopsis B-carotene ♦ 
hydroxylase was sequenced completely on both strands on an automatic sequencer (Applied 
5 Biosystems, Model 373 A, Version 2.0. IS). The cDNA insert of 0.95kb also was excised 
and ligated into the a pTrcHis vector. A BgH\ restriction site within the cDNA was used to 
remove that portion of the cDNA that encodes the predicted polypeptide N terminal sequence 
regioti that is not also found in bacterial P-carotene hydroxylases (Figure 6). A Bglll-Xhol 
fragment was directionally cloned in BamHI-XhoI digested TrcHis vectors. 

1 0 Pigment Analysis 

A single colony was used to inoculate 50 ml of LB containing ampicillin and 
chloramphenicol in a 250-ml flask. Cultures were incubated at 28°C for 36 hours with gentle 
shaking, and then harvested at 5000 rpm in an SS-34 rotor. The cells were washed once with 
distilled H 2 0 and resuspended with 0.5 ml of water. The extraction procedures and HPLC 

15 were essentially as described previously (Cunningham et al, 1994). 

II. Isolation and biochemical analysis of an Arabidopsis lvcopene e-cyclase 
Plasmid Construction 

Construction of plasmids pAC-LYC, pAC-NEUR, and pAC-ZETA is described in 
Cunningham et al., (1994). In brief, the appropriate carotenoid biosynthetic genes from 

20 Erwinia herbicola, Rhodobacter capsulatus, and Synechococcus sp. strain PCC7942 were 

cloned in the plasmid vector pACYCl 84 (New England BioLabs, Beverly, MA). Cultures of 
E. coli containing the plasmids pAC-ZETA, pAC-NEUR, and pAC-LYC, accumulate C- 
carotene, neurosporene, and lycopene, respectively. The plasmid pAC-ZETA was 
constructed as follows: an 8.6-kb Bgin fragment containing the carotenoid biosynthetic genes 

25 of E. herbicola (GenBank M87280; Hundle et al., 1991) was obtained after partial digestion 
of plasmid pPL376 (Perry et al., 1986; Tuveson et al., 1986) and cloned in the BamHI site of 
pACYC184 to give the plasmid pAC-EHER. Deletion of adjacent 0.8- and 1.1-kb BamHI- 
BamHI fragments (deletion Z in Cunningham et al., 1994), and of a 1.1 kB Sall-Sall fragment 
(deletion X) served to remove most of the coding regions for the E. herbicola p -carotene 

30 hydroxylase (crtZ gene) and zeaxanthin glucosyltransferase (crtX gene), respectively. The 
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resulting plasmid, pAC-BETA, retains functional genes for geranylgeranyl pyrophosphate 
synthase (crtE), phytoene synthase (crtB), phytoene desaturase (crtl), and lycopene cyclase 
(crtY). Cells of E. coli containing this plasmid form yellow colonies and accumulate P- 
carotene. A plasmid containing both the lycopene e- and P -cyclase cDNAs of A. thaliana 
was constructed by excising the e-cyclase in clone y2 as a PvuI-PvuII fragment and ligating 
this piece in the SnaBI site of a plasmid (pSPORT 1 from GIBCO-BRL) that already 
contained the p -cyclase (Cunningham et al., 1996). 



Organisms and Growth Conditions 

E. coli strains TOP 10 and TOP 10 F' (obtained from Invitrogen Corporation, San 
10 Diego, CA) and XL1 -Blue (Stratagene) were grown, in Luria : Bertani (LB) medium , 

(Sambrook et al., 1989) at 37°C in darkness on a platform shaker at 225 cycles per min. 

Media components were, from Difco (yeast extract and tryptone) or Sigma (NaCl). 

Ampicillin at 150 jig/mL and/or chloramphenicol at 50 ^g/mL (both from United States 

Biochemical Corporation) were used, as appropriate, for selection and maintenance of 
15 plasmids. 



Mass Excision and Color Complementation Screening of an A. thaliana cDNA I Jhrary 
A size- fractionated 1-2 kB cDNA library of A. thaliana in lambda ZAPII (Kieber et 
al., 1993) was obtained from the Arabidopsis Biological Resource Center at The Ohio State 
University (stock number CD4-14). Other size fractionated libraries were also obtained 

20 (stock numbers CD4-13, CD4-15, and CD4-16). An aliquot of each library was treated to 

cause a mass excision of the cDNAs and thereby produce a phagemid library according to the 
instructions provided by the supplier of the cloning vector (Stratagene; E. coli strain XL1- 
Blue and the helper phage R408 were used). The titre of the excised phagemid was 
determined and the library was introduced into a lycopene-accumulating strain of E. coli 

25 TOP10 F (this strain contained the plasmid pAC-LYC) by incubation of the phagemid with 
the E. coli cells for 15 min at 37°C. Cells had been grown overnight at 30°C in LB medium 
supplemented with 2% (w/v) maltose and 10 mM MgS0 4 (final concentration), and harvested 
in 1.5 ml micro fuge tubes at a setting of 3 on an Eppendorf microfuge (541 5C) for 10 min. 
The pellets were resuspended in 10 mM MgS0 4 to a volume equal to one-half that of the 
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initial culture volume. Transformants were spread on large (150 mm diameter) LB agar petri 

plates containing antibiotics to provide for selection of cDNA clones (ampicillin) and. 

maintenance of pAC-LYC (chloramphenicol). Approximately 10,000 colony forming unitls 

were spread on each plate. Petri plates were incubated at 37-C for 16 hr and then at room 

5 temperature for 2 to 7 days to allow maximum color development. Plates were screened 

visually with the aid of an illuminated 3x magnifier and a low power stage-dissecting 

• * 

microscope for the rare, pale pinkish-yellow to deep-yellow colonies that could be observed 
in the background of pink colonies. A coiony color of yellow or pinkish-yellow was taken as ■ 
presumptive evidence of a cyclization activity. These yellow colonies were collected with 

10 . sterile toothpicks and used to inoculate 3ml of LB medium in culture tubes with overnight 
growth at 37 °C and shaking at 225 cycles/min. Cultures were split into two aliquots in 
microfuge tubes and harvested by centrifiigation at a setting of 5 in an Eppendorf 541 5C 
micro fuge. After discarding the liquid, one pellet was frozen for later purification of plasmid 
DNA. To the second pellet was added 1 .5 ml EtOH, and the pellet was resuspended by 

15 vortex mixing, and extraction was allowed to proceed ift the dark for 15-30 min with 

occasional remixing. Insoluble materials were pelleted by centrifiigation at maximum speed 
for 10 min in a microfuge. Absorption spectra of the supernatant fluids were recorded from 
350-550 nm with a Perkin Elmer lambda six spectrophotometer. 

Analysis of isolated clones 
20 , Eight of the yellow colonies contained p -carotene indicating that a single gene 

product catalyzes both cyclizations required to form the two p endgroups of the symmetrical 
P -carotene from the symmetrical precursor lycopene. One of the yellow colonies contained a 
pigment with the spectrum characteristic of 6 -carotene, a monocyclic carotenoid with a single 
e endgroup. Unlike the p cyclase, this €-cyclase appears unable to carry out a second 

25 cyclization at the other end of the molecule. 

The observation that €-cyclase is unable to form two cyclic e-endgroups (e.g. the 
bicyclic €-carotene) illuminates the mechanism by which plants can coordinate and control 
the flow of substrate into carotenoids derived from P -carotene versus those derived from a- 
carotene and also can prevent the formation of carotenoids with two € endgroups. 

30 The availability of the A. thaliana gene encoding the e-cyclase enables the directed 

manipulation of plant and algal species for modification of carotenoid content and 
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composition. Through inactivation of the 'e-cyclase, whether at the gene level by deletion of 
the gene or by insertional inactivation or by reduction of the amount of enzyme formed (by 
such as antisense technology), one may increase the formation of P -carotene and other 
pigments derived from it. Since vitamin A is derived only from caroteiloids with p 
5 endgroups, an enhancement of the production of P-carotene versus a-carotene may enhance 
nutritional value of crop plants. Reduction of carotenoids with e-endgroups may also be of 
value in modifying the color properties of crop plants and specific tissues of these plants. 

Alternatively, where production of a-carotene, or pigments such as lutein that are derived 

" * 't 

from a-carotene, is desirable, whether for the color properties, nutritional value or other 
1 0 reason, one may overexpress the e-cyclase or express it in specific tissues. Wherever 

agronomic value of a crop is related to pigmentation provided by carotenoid pigments the 
directed manipulation of expression of the e-cyclase gene arid/or production of the enzyme 
may be of commercial value. 

The predicted amino acid sequence of the A. thaliana e-cyclase enzyme was 
15 determined. A comparison of the amino acid sequences of the P- and e-cyclase enzymes of 
Arabidopsis thaliana (Fig. 13) as predicted by the DNA sequence of the respective cDNAs 
(Fig. 4 for the e-cyclase cDNA sequence), indicates that these two enzymes have many 
regions of sequence similarity, but they are only about 37% identical overall at the amino acid 
level. The degree of sequence identity at the DNA base level, only about 50%, is sufficiently 
20 low such that we and others have been unable to detect this gene by hybridization using the P 
cyclase as a probe; in DNA gel blot experiments. 
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We claim: 

1. An isolated and/or purified nucleic acid sequence which encodes for a protein haviitg 
lycopene e-cyclase enzyme activity and has an amino acid sequence which is at least 85% 
identical to one of SEQ ID NOS: 23 or 25-27. 

5 2. The nucleic acid sequence of claim 1, wherein the protein has the amino acid 
sequence of one of SEQ ID NOS: 23 or 25-27. 

3. A vector comprising the nucleic acid sequence of claim 1 , wherein the nucleic acid 
sequence is operably linked to a promoter. 

4. A host cell which contains the vector of claim 3. 

10 5. The host cell of claim 4, wherein the host cell is selected from the group consisting of 
a bacterial cell, an algal cell, a yeast cell and a plant cell. 

6. The host cell of claim 4, wherein the host cell is a photosynthetic cell. 

7. An isolated and/or purified protein having lycopene e-cyclase enzyme activity and 
having. an amino acid sequence which is at least 85% identical to one of SEQ ED NOS: 23 or 

15 25-27. 

8. The protein of claim 7, wherein the protein has the amino acid sequence of one of 
SEQ ID NOS: 23 or 25-27. 
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AMENDED CLAIMS 

[received by the International Bureau on 1 5 November 1 999 (15.11 .99); 
original claims 1,2,7 and 8 amended; remaining claims unchanged (1 page)] 
1 . An isolated and/or purified nucleic acid sequence which encodes for a protein having 
lycopene e-cyclase enzyme activity and has an amino acid sequence which is at least 85% 
identical to one of SEQ ID NOS: 23, 25 or 26. 

» 

5 2. The nucleic acid sequence of claim 1, wherein the protein has the amino acid 
sequence of one of SEQ ID NOS: 23, 25 or 26. 

3. A vector comprising the nucleic acid sequence of claim 1, wherein the nucleic acid . 
sequence is operably linked to a promoter. 

4. A host cell which contains the vector of claim 3. 

10 5. The host cell of claim 4, wherein the host cell is selected from the group consisting of 
a bacterial cell, an algal cell, a yeast cell and a plant cell. 

6. The host cell of claim 4, wherein the host cell is a photosynthetic cell. 

7. An isolated and/or purified protein having lycopene €-cyclase enzyme activity and 
having an amino acid sequence which is at least 85% identical to one of SEQ ID NOS: 23, 25 

15 or 26. 

8. The protein of claim 7, wherein the protein has the amino acid sequence of one of 
SEQ ID NOS: 23, 25 or 26. 
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F I G. 4 A 

Arabidopsis thaliana epsilon cyclase: 

acaaaaggaaataattag attcct'ctttctgcttgctataccttgaca 48 

gaacaacataacaatggtgtaagtcttctc gctgtattcgaaattatttggaggaggaac 108 

- atggagtgtgttggggctaggaatttcgca gcaatggcggtttcaacatttccgtcatgg 168 
1 M EC VTA RN FA TM A V S T F P S W 

agttqtcqaaggaaatttccagtggctaag agatacagctataggaatattcgcttcggt 228 
21 S C R RK F P \ni K RYSYRNlRFb 

ttgt^ta^tgtcagagcta^cggcggcgga a^ttccggta^tgaga^tt^tgtagcggtg 288 

aqagaagatttcgctgacgaagaagatttt gcgaaagctggcggttctgagattctattt 348 
61 R S D F VD E E D F V E -A ^ ^ S R I L F 

gttcaaatgcagcagaacaaagatatggat gaacagtctaagcttgttgataagttgcct 408 
81 VQMQQMKDMD SQSKLVDKL'P 



101 
121 



cctatatcaactggtgatggtgctttggat catgtggttactgqctqtgqtcctgctggt 468 
PISllaDlaALD KVVIGCGPAG 

ttagccttggctgcagaatcagctaagctt ggattaaaagttggactcattggtccagat 528 
L R C% A K S 7i K L ^ L K V ^ L I ^ P D 

cttccttttactaacaattacggtgtttgg gaagatgaattcaatgatcttgggctgcaa 588 
141LPFTMMYGVM KDKFNDLGLG 

aaatgtattgagcatgtttggagagagact attgcgcacctggatgatgacaagcctatt 648 
161 K C I XK V W RT T IVYLDDDKPI 

accattggccgtgrtta^^ 708 
aq^gtgtcgagtcaggtgtctcgtacctt a^ctcgaaagttgacag^cataacagaagct 768 

tgtgatggccttagacttgttgcttgtgac gacaataacgtcattccctgcaggcttgcc 828 
221 S D la L XL V A C D DMMVIPCXLA 

actgttgcttctggagcagcttcgggaaag rtcttgcaatacgaagttggtggacctaga 888 
7 gtctgtgcgcaaartgcatacggcgtggag gttgaggcggaaaatagtccatatgatcca 948 
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FIG. 4B 

261 VCVQTAYGVX VXVXNSPYDP 

qatcaaatggttttcatggattacagagat tatactaacgagaaagttcggagcttagaa 1006 
281L)QMVPMDYRD YTMXXVRSLX 

qctgagtatccaacgtttctgtacgccatg cctatgacaaagtcaagactcttcttcgag 1068 
^ICYPTFLYAM PMTKSRLFFK 

qaqacatqtttggcctcaaaagatgtcatg .ccctttgatttgctaaaaacgaagctcatg 1128 
XTCLASKDVM PFDLLKTKLM 

ttaaqattaqacacactcggaattcgaatt ctaaagacttacgaagaggagtggtcctat 1188 
I P Vb'G STP NT XQKNLAFGAA 

atcccagttggtggttccttgccaaacacc gaacaaaagaatctcgcctttggtgctgcc 1248 
361 I P V^'^ S L P M T XQKNLAFGAA 

gc^aq^tggUcatcccg^^gg^tat tcagttgtgaqatctttgtctgaagctcca 1308 
aaacatgcatcagtcatcgcagagatacta agagaagagactaccaaacagattaacagt 1368 

kyTsviakil reettkqins 

aatatttcaagacaagcttaggatacttta tggccaccagaaaqqaaaagacagaqagca 1428 
MISRQAWDTL WPPERXRQRA 

■ * * " 

ttctttctctttggtcttgcactcagagtt caattcgataccgaaggcattagaagcttc 1488 
441 FFLFbLALlV QFDTXGIRSF 

ttccgtactttcttccgccttccaaaatgg atgtggcaagggtttctaggatcaacatta 1548 
FRTPFRLPKW MWQGFLGSTL 

acatcaqgagatctcgttctctttgcttta tacatgttcgtcatttcaccaaacaatttg 1608 
T S ITd LVLF^L YMPVISPMML 

aqaaaaqqtctcattaatcatctcatctct gatccaaccggagcaaccatgataaaaacc 1668 

tatctcaaagtatgatttacttaccaactc ttaggtttgtgtatatatatgccgatttat 1728 
521 Y L K V 

ctgaataatcgatcaaagaatggtatgtgg gttactaggaagttggaaacaaacacgtat 1788 

agaatctaaggagtgatcgaaatggagacg gaaacgaaaagaaaaaaatcagtctttgtt 1848 

ccgtggctagtg 1S68 



381 
401 
421 



461 
481 
501 
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FIG. 5 

* » 

1 gctctttctc ctcctcctct accgatttcc gactccgcct cccgaaatcc 

51 ttatccggat tctctccgtc tcttcgattt aaacgctttt ctgtctgtta 
101 ' cgtcgtcgaa gaacggagac agaattctcc gattgagaac gatgagagac 

151 cggagagcac gagctccaca aacgctatag acgctgagta tctggcgttg 

201 cgtttggcgg agaaattgga gaggaagaaa tcggagaggt ccacttatct . 

251 aatcgctgct atgttgtcga gctttggtat cacttctatg gctgttatgg 

301 ctgtttacta cagattctct tggcaaatgg agggaggtga gatctcaatg 

351 ttggaaatgt ttggtacatt tgctctctct gttggtgctg ctgttggtat 

401 ggaattctgg gcaagatggg ctcatagagc tctgtggcac gcttctctat 

451 ggaatatgca tgagtcacat cacaaaccaa gagaaggacc gtttgagcta 

501 aacgatgttt ttgctatagt gaacgctggt ccagcgattg gtctcctctc 

551 ttatggattc ttcaataaag gactcgttcc tggtctctgc tttggcgccg 

601 ggttaggcat aacggtgttt ggaatcgcct acatgtttgt ccacgatggt , 

651 ctcgtgcaca agcgtttccc tgtaggtccc atcgccgacg tcccttacct 

701, ccgaaaggtc gccgccgctc accagctaca tcacacagac aagttcaatg 

751 gtgtaccata tggactgttt cttggaccca aggaattgga agaagttgga 

801 ggaaatgaag agttagataa ggagattagt cggagaatca aatcatacaa 

851 aaaggcctcg ggctccgggt cgagttcgag ttcttgactt taaacaagtt 

901 ttaaatccca aattcttttt ttgtcttctg tcattatgat catcttaaga 

951 cggtct 
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FIG. 7 



1 




gcctccccgt 
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10 1 

X W X 
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501 


aaggcacctt 


ctgatgggaa 


atggggagag 


cacgaag-ttg 


actatctact 
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FIG. 8 

1 cectgte tgtttcttct ttatttaatc tccc,tt,.t tc g cctca ga 
51 "tctcgctc tttcgtcttc tttttcttct ttecGATTTG. CCCATCCTCC 
101 TCTGTCATC.G' ATTTCACCGA GAAAGTTACC GAATTTTCGT GCTTTCTCTG 
151 GTACCGCTAT GACAGATACT AAAGATGCIG GTATGGATGC TGTTCAGACA 
201 CGTCTCATGT TTGAGGATGA ATGCATTCTT GTTGATGAAA CTGATCGTGT 
251 TGTGGGGCAT GTCAGCAAGT ATAATTGTCA TCTGATGGAA AATATTGAAG 
301 CCAAGAATTT GCTGCACAGG GCTTTTAGTG TATTTTTATT CAACTCGAAG 
3 51 TATGAGTTGC TTCTCCAGCA AAGGTCAAAC ACAAAGGTTA CGTTCCCTCT 
401 AGTGTGGACT AACACTTGTT GCAGCCATCC TCTTTACCGT GAATCAGAGG 
451 TTATCCAGGA CAATG CACT A GGTGTGAGGA ATGCTGCACA AAGAAAGCTT 
501 CTCGATGAGC TTGGTATTGT AGCTGAAGAT GTACCAGTCG ATGAGTTCAC 
551 TCCCTTGGGA CGTATGCTGT ACAAGGCTCG TTCTGATGGC AAATCGGGAG 
601 AGCATGAACT TCATTACTTG CTCTTCATCG TGCGAGACGT GAAGCTTCAA 
651 CCAAACCCAG ATGAAGTAGC TGAGATCAAC TATGTGAGCC GGGAAGAGCT 
701 GAAGGAGCTG GTGAAGAAAG CAGATGCAGG TGAGGAAGGT TTGAAACTGT 
751 CACCATGGTT CAGATTCCTG GTGGACAATT TCTTCATGAA GTGGTGGGAT 
801 CATGTTGAGA AAGGAACTTT GGTTGAAGCT AT AG A CATG A AAACCATCCA 
851 CAAACTCTGA ACATCTTTTT TTAAAGTTTT TAAATCAATC AA.CTTTCTCT 
901 TCATGATTTT TAT CTTTTCG ATGATAATAA TTTGGGATAT GTGAGACACT 
951 TACAAAACTT CCAAGCACCT CAGGCAATAA TAAAGTTTGC GGCCGC 
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CTCGGTACCT 


GGCCACAATC 
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GATGCCGCGA 


TGCTTCGTTC 
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GCCCAGCXGC 


151 


AGCTCAGGAG 


CATGCAGATG 
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CTGTCGCGCG 


CCGAGGACCG 
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CGCAGGCCGG 


CAGTCCCAGG. 
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TGGTGGATGT 


TGAGGACAAC 


351 


CACAAGTTCC 


T A C CA CAT C A 


401 


TGTGTTCCTG 


TTTGACGATC 


4 51 


GCTCAAAAAT 


CACCTTCCCA 








«J -J X 


CGACCGAACA 


GTACCTGCCG 


QUI 






651 


CTCACGCCTT 


TCCACTACTG 


701 


" ATCAGCGCTC 


TGGGGCG^GC 


7 51 


CCAACGTCAC 


CTTCCCGCCC 


801 


GTGACGCAAG 


AGGACCTGCG 
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ATGGTCGCCG 


TGGTTTCGCA 


901 . 


GGGCTGACCT 


GGXCGCGGCC 


951 


ACGGTGCATd 


ACATCAACGA 
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CACACGTCAT 


GGGGTGGAAT 


1051 


TCTGAGACTG 


AACCTGCAGT 


1101 


GATAAAATGT 


ACCGTCACTT 


1151 


AAAAAAAAAA 


AAAAA 
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TCATCGCCGC 


GCGCTTCCTT 


GAGCGTTCGT 


CTAAACACTG . 


ACAAACACGA 


GGATTGGGGA 


AGCGTGAAAG 


CAGAAGCTGC 


AGGATGTGAA 


TGCGTACTTG 


GCAGCTTCGT 


ATCTCC1TTT 


CAGGTCCCAC 


AAGGTCAGGT 


AAAATGGCTC 


TTTGTCGCGT 


ATACTGAACT 


CCAAGAGGTC 
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FIG. 10 



L CTCGGTAGCT GGCCACAATC G CTATTTGG A ACCTGGCCCG GCGGCAGTCC 
51 GATGCCGCGA TGCTTCGTTC GTTGCTCAGA GGCCTCACGC ATATCCCGCG 



151 


ACCTCAGGAC 


GATGCAGCTG 


CTTTCCGAGG 


ACCCCACAGA 


CCACATGAGG 


201 


GGTGCAAGCA 


CCTGGGCAGG 


CGGGGAGTCG 


CAGGATGAGC 


TGATGCTGAA 


251 


GGACGAGTGC 


ATCTTGGTAG 


ATGTTGAGGA 


CAACATCACA 


GGCCATGtCA 


301 


GCAAGCTGGA 


GTGTCACAAG 


TTCCTACCAC 


ATCAGCCTGC . 


AGGCCTGCTG 



401 


GCAACAGCGT 


CCACCCT CAA 


AAATCACCTT 


CCCAAGTCTC 


TGGACGAACA 


451 


CCTGCTGCAG 


CCACCCTTTA 


CATGGGCAGA 


CCCCAGATGA 


GGTGGACCAA 


501 


CT AAG C CAGG 


TGGCCGACGG 


AACAGTACCT 


GGCGCAAAGG 


CTGCTGCCAT 


551 


CCGCAAGTTG 


GAGCACGAGC 


TGGGGATACC 


AGCGCACCAG 


CTGCCGGCAA 


601 


GCGCGTTTCG 


CTTCCTCACG 


CGTTTGCACT 


ACTGTGCCGC 


GGACGTG CAG 


651 


CCAG.CTtfCGA 


CACAATCAGC 


GCTCTTGGGGC 


GAG CACG AAA 


TGCACTACAT 


701 


CTTGTTCATC 


CGGGCCXXCG 


TCACCTTGGC 


GCCCAACCCT 


GACGAGGTGG 


7 51- 


ACGAAGTCAG 


GTACGTGACG 


CAAG AGGAG C 


TGCGGCAGAT 


GATGCAGCCG 


aoi 


GACAACGGGC 


TTCAATGCTC 


GCCCTGGTTT 


CGCATCATCG 


CCGCGCGCTT 


851 


CCTTGAGCGT 


TGGTGGGCTG 


ACCTGGACGC 


GGCCCTAAAC 


ACTGACAAAC 


901 


ACGAGGATTG 


GGGAACGGTG 


CAT C A CAT C A 


ACGAAGCGTG 


AAGGCAGAAG 


951 


CTGCAGGATG 


TGAAGACACG 


TCATGGGGTG 


GAATTGCGTA 


CTTGGCAGCT 


1001 


TCGTATCTCC 


TTTTTCTGAG 


ACTGAACCTG 


CAGAGCTAGA 


GTCAATGGTG 


1051 


CAT CAT ATT C 


ATCGTCTCTC 


TTTTcrrrrA 


GACTAATCTG 


TAGCTAGAGT 


1101 


CACTGATGAA 


TCCTTTACAA 


CTTTCAAAAA 


AAAAA 





101 CGTCAACTCC GCCCXGCXGC CCAGCTGTGC 



ACACGCGCGA CTCCACTTTA 



351 CACCGGGCCJT TCTCTGTGTT CCTGTTTGAC 



GATCA GGGGC GACTGCTGCT 
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FIG. I I A 



1 



HP04 MLRSLLRGLT HIPRVNSAQQ 
HP05 MLRSLLRGLT HIPRVNSAQQ 
ATDP7 MSVSSLFNLP .LIRLRSLA. 
C.brew. MS.SSMLNFT .ASRIVSLPL 

AT0P5 TGPPPRFFP 

S.eerev. . .MTADNNSM PHGAVSSYAK 



51 

AEDRTDHMRG 
SEDRTDHMRG 
S. .GTA.MTD 
SSQATT.MGE 

T.MTD 

SSETSNDESG 

101 

LPHQPAGLLH 
LPHQPAGLLH 
ENIEAKNLLH 
ENIESENLLH 
EKIEAENLLH 
ENIE.KGLLH 



ASTWAGGQSQ 
ASTWAGGQSQ 
TKDAGMDAVQ 
VVDAGMDAVQ 
SNDAGMDAVQ 
ETCFSGHDEE 



RAFSVFLFDD 
RAFSVFLFDD 
RAFSVFLFNS 
RAFSVFLFNS 
RAFSVFLFNS 
RAFSVFIFNE 



151 

GQTPDEVDQL SQVADGTVPG 
GQTPDEVDQL SQVADGTVPG 

RE SELIQDNALG 

RE SELIDENCLG 

RE SELIEENVLG 

ID...DELGL KGKLDDKIKG 

201 

RLHYCAADVQ PAATQSALWG 
RLHYCAADVQ PAATQSALWG 

RMLY.. KAPSDGKWG 

RILY KAPSDGKWG 

RMLY KAPSDGKWG 

RIHY MAPSNEPWG 



PSCAHARLQF 
PSCAHARLQF 
LSSSFSSFRF 
LSSPPSRVHL 
IRSPVPRTQL 
LVQNQTPEDI 



DELMLKDECI 
DELMLKDECI 
RRLMFEDECI 
RRLMFEDECI 
RRLMFEDECI 
QIKLMNENCI 



QGRLLLQQRA 
QGRLLLQQRA 
KYELLLQQRS 
KYELLLQQRS 
KYELLLQQRS 
QGELLLQQRA 



AKAAAIRKLE 
AKAAAIRKLE 
VRNAAQRKLL 
VRNAAQRKLL 
VRNAAQRKLF 
AITAAVRKLD 



EHEMDYILFI 
EHEMDYILFI 
EHELDYLLFI 
EHELDYLLFI 
EHEVDYLLFI 
EHEIDYILFY 



50 

KLRSMQMTLM QPSISANLSR 

KLRSMQLL 

AHRPLSSIS. PRKLPNFRAF 
PLCFFSPISL TQRFSAKLTF 

FVRAFSAV 

LEEFPEIIPL QQRPN...TR. 



LVDVEDNITG 
LVDVEDNITG 
LVDETDRVVG 
LVDENDKVVG 
LVDENNRVVG 
VLDWDDNAIG 



RSKITFPSVW 
RSKITFPSVW 
NTKVTFPLVW 
ATKVTFPLVW 
KTKVTFPLVW 
TEKITFPDLW 



HELGIPAHQL 
HELGIPAHQL 
DELGIVAEDV 
DELGIPAEDL 
DELGIVAEDV 
HELGIPEDET 



100 

HASKLECHKF 
HASKLECHKF 
HVSKYNCHLM 
HESKYNCHLM 
HDTKYNCHLM 
AGTKKVCHLM 

150 

TNTCCSHPLH 
TNTCCSHPLH 
TNTCCSHPLY 
TNTCCSHPLY 
TNTCCSHPLY 
TNTCCSHPLC 

200 

PA.SAFRFLT 
PA.SAFRFLT 
PV.DEFTPLG 
PV.DQFIPLS 
PV.DEFTPLG 
KTRGKFHFLN 



250 

. . RANVTL APNPDEVDEV 
..RANVTL APNPDEVDEV 
..VRDVKV QPNPDEVAEI 
..IRDVNL DPNPDEVAEV 
. . VRDVKL QPNPDEVAEI 
KINAKENLTV NPNVNEVRDF 
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FIG. MB 

25 * 300 
RYVTQEELRQ MMQ....PDN GLQWSPWFRI IAARFLERWW ADLDAALNTD 

RYVTQEELRQ. f*WQ PDN GLQWSPWFRI IAARFLERWW ADLDAALNTD 

KYVSREELKE LVKKADAGEE GLKLSPWFRL VVDNFLMKWW DHVEKGTLVE 
KYMNRDDLKE LLRKADAEEE GVKLSPWFRL VVDNFLFKWW DHVEKGSLKD 
KYVSREELKE LVKKADAGDE AVKLSPWFRL VVDNFLMKWW DHVEKGTITE 
KWVSPNDLKT MF ADP SYKFTPWFKI ICENYLFNWW EQLDDLSEVE 

301 ,\ - 

KHEDWGTVHH INEA* 

KHEDWGTVHH INEA* 

A.IDMKTIHK L* 

A.ADMKTIHK L* 

A.ADMKTIHK L* 

A.ADMKTIHK L* 

NDRQ...IHR ML* 



ISDOCID: <WO 9963055A 1_IA> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/63055 PCT/US99/12121 



15/45 



FIG. 12 



1 ' 


ccaaaaacaa 


ctcaaatctc 


cticcgrcgcH 


cttac-tccgc catgggrgac 


51 


cractccGcca 


tggatgc-tg-t 


tcagcgacgt: 


cticatgttitg 


acgatgaatg 


101 


cattttacrtc 


gatgag-Cgtg 


acaatg-t:gg-t: 


gggacatgat 


accaaataca 


151 ■ 


attctcactt 


gatggagaag attgaaacag 


gtaaaatgct 


gcacag^gca 


201 


ttcagcgttt 




ttcaaaatac 


gagttacttc 


ttcagcaacg 


251 


ctlctlcrcA act 


aaggtgacat 


titicctrttiag-t 


atggaccaac 


acctgttgca 


301 




ctacagagaa 


tccgagctriig ' 


ttcccgaaac 


gcctgagaga 


351 




gaggaxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 


401 




xxxxxxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 


451 


xxxxxxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 


501 


xxxxxxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 


551 


xxxxxxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 


601 


xxxxxxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 


651 


xxxxxxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 


tcatgtgcaa 


aagggtacac 


701 


tcactgaatg 


caatttgata 


tgaaaaccat 


acacaagctg 


atatagaaac 


751 


acaccctcaa 


ccgaaaagca 


agcctaataa 


tticggg^tgg 


g-tcgggtcta 


801 


ccatcaattg 




ttaacaactt 


ttaatctcta 


tttgagcatg 


851 


ttgattcttg 


tctctirgtgt 


gtiaagattitr 


gggtttcgtt 


tcagrtgtaa 


901 


taatgaacca 


ttgatggrtt: 


gcaatttcaa 


gttcctatcg 


acatgragrg 


951 


atctaaaaaa 











BNSDOCID: <WO 996305SA1JA> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/63055 



PCT/US99/12121 



16/45 



. >o 

! u5 

I CD 
i u_ 

! Q 
COLUl 

I > 

I > 
1 o 
CDOO 
M—LU 
I OO 
I CD 

I go 

op 

oo§! 
i > 
• go 



I CD 
J_LL_1 



CD CO 

I ^ 
CL> 
I > 
• CLl 

O-CO 



< 

ro 



QUO 



o on o_ 

^CDCD I 
CO — I I 



ooi 



I 



CD g to 

-cj-io c: 

to CD 
4-> CXLO 

c 5j c 

fO . O 
. — HoCJ 



CD CD CD 
<C_I I 

LU^ I 

GO<C I 

CD CD CD 

«< 
CL.O-Q- 
CDCDCD 

cScDCD 




<CLU I 
CO CO GO 
CO CD I 
I CD I 



CD £ to 

to cd 
ccuc 

rO . O 
. HOC-) 



CD 
s~ 

13 
-4— > 

rO 
C . 
CD 

CO 

CD 



JQ 
I 

CD 



O 
CD 



OCDCDCD 
H2^ 1 
M I UJ I 
I 

C->C-)CJ 



c: 

to 
CJ 



to 
to 

& 




CD CD CD 



o-<r i 

^CD [ 

! 

I Q- I 
I Q I 

T3QQ 

•r- I I 



I I • I 

rol — I 

CDLU I 

GOO£ 1 



I — > I 

raze I 

3l±J I 

I— I I 

C-> 

§3 ! 

u5§i ! 

LX-LllU- 
UJLULU 
QQQ 



CD CD CD 



rHQ.Q.a 

i — l «Q 



Ho O —5 
Qjr-*o tO 

to a> 
ccuc 

ro • O 
. — 4oCJ 



Sarco i 
CD2T I 

>> > 
lulu cn 

^5l5 ! 
* — i > i 

CD CD CD 

<c<c<x. 



CCL 
>- CD 

ex. cd 
^> 

QUJ 



ooa 



I CD 
or CO 
co<C 

cdcBcd 

I — OO I 

<c<c<c 

CD> 



c: 
o 

ST 



CD 
> 

CD 

to 
o 

C-> 



^3 

<cc_> 



C=CD 

o^S: 
— I 

• oo 



-^OO 

•r-fZD 

>>:> 

rO OO 

cruo 
rc i 

i — iLl_>- 

r-H^OO 



CD c: to 

Ho Q> =3 
Qji-o tO 

to CD 
4-> CXtO 
C CD c 
rO • O 
. — HoCJ 
CL. • 



SUBSTITUTE SHEET (RULE 26) 



JSDOCID: <WO 99630S5A1, IA> 



WO 99/63055 



PCT/US99/12121 



17/45 



s 



:cl i 
i 
i 



(/)U 1 

ILL- I 

LL.U-U- 
i— i I I 

ZOO I 



cut 



; i 



O-Q-O- 



(VI 
I 



t 2 i 
i ex. I 

COU_>- i 
CNJCL.CL.C1_ 



O CD LULU 
CXJOOLU I 

>-•— ■ I 
I LU I 

^> I 



S 1 

QJ 

QJ 
to 

8 



CvJ 



? 

OJ 

z 

QJ 

to 

o 





QJ 



CD 

-a 

QJ 

to 
o 




run i 



SLulu 



QLU I 

LULULU 
LU>- I 

»| — I 

^ — i i 
>i— i i 

• — • I — • — » 

CD CD CD 



po: i 

Ll-LI-Li- 

UUU.UL 



| i I 

<CCD l 
ni 1 1 'i 
—Ji — i 

QOO 
ILL. I 

— j> i 

§3 ! 
£££ 

C-> — I I 




CO I 



i 



0£N^ I 



0-0.0- 



I •— < I 

aj • i 

I z I 

CDt/") I 

•Z I 

• i I 

tOCX I 

r-H I ^ I 

CVJ tO| ! 



QJ 

QJ 
to 
c 

cS 



I CD 
I — t/"> 
CD >— « 
• — I 



UJCD 



— I 



OOO- 

<coo 




I/} CD 



GO 

ro 

C9 



QJ 



QJ 



-a 

QJ 



4o O 3 
Qj»-o CO 

to a> 

+-> cx_to 
c: a> c 

H3 • O 
< — 4oU 

cl. ♦ 



CD g tO 
tO 

o> aj 
-m cx o 

C £Z 

rd . o 

. — +oU 

a- • 



5 ¥2 

Qjr-o tO 
CO QJ 

4— > cx CO 
fO • o 

■ — 4oU 



o c co 

iJ O 3 

tV*** to 

CO QJ 
4-> CX. on 

<T3 ♦ O 

r— +JU 

CL. • 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 9963055A1_IA> 



WO y9/63055 PCT/US99/12121 



18/45 

F I G. I4A 

Atom's paloestino ^-cyclase cDNA #5 Length: 1898 

1 aaaggagtgt tctattaatg ttactgtcgc attcttgcaa cacttatatt 
51 caaactccat tttcttcttt tttcttcaaa acaacaaact aatgtgagca 
101 gagtatctgg ctatggaact acttggtgtt cgcaacctca tctcttcttg 
151 ccctgtgtgg acttttggaa caagaaacct tagtagttca aaactagctt 
201 ataacataca tcgatatggt tcttcttgta gagtagattt tcaagtgaga 
251 gctgatggtg gaagcgggag tagaagttct gttgcttata aagagggttt 
301 tgtggatgaa.gaggatttta tcaaagctgg tggttctgag cttttgtttg 
351 tccaaatgca gcaaacaaag tctatggaga aacaggccaa gctcgccgat 
401 aagttgccac caataccttt tggagaatcc gtgatggact tggttgtaat 
451 aggttgtgga cctgctggtc tttcactggc tgcagaagct ' gctaagctag 
501 ggttgaaagt tggccttatt ■ ggtcctgatc ttccttttac aaataattat 
551 ggtgtgtggg aagacgagtt caaagatc'tt ggacttgaac gttgtatcga 
601 gcatgcttgg aaggacacca tcgtatatct tgataatgat gctcctgtcc 
651 ttattggtcg tgcatatgga cgagttagtc gacatttgct acatgaggag 
701 ttgctgaaaa ggtgtgtgga gtcaggtgta tcatatctgg attctaaagt 
751 ggaaaggatc actgaagctg gtgatggcca tagccttgta gtttgtgaaa 
801 atgagatctt tatcccttgc aggcttgcta ctgttgcatc tggagcagct 
851 tcagggaaac ttttggagta tgaagtaggt ggccctcgtg tttgtgtcca 
901 aaccgcttat ggggtggagg ttgaggtgga gaacaatcca tacgatccca 
951 acttaatggt attcatggac tacagagact atatgcaaca gaaattacag 
1001 tgctcggaag aagaatatcc aacatttctC tatgtcatgc ccatgtcgcc 
1051 aacaagactt ttttttgagg aaacctgttt ggcctcaaaa gatgccatgc 
1101 cattcgatct actgaagaga aaactgatgt cacgattgaa gactctgggt 
1151 atccaagtta caaaagttta tgaagaggaa tggtcatata ttcctgttgg 
1201 tggttcttta ccaaacacag agcaaaagaa cctagcattt ggtgctgcag 
1251 caagcatggt gcatccagca acaggctatt cggttgtacg gtcactgtca 
1301 gaagctccaa aatatgcttc tgtaattgca aagattttga agcaagataa 
1351 ctctgcgtat gtggtttctg gacaaagtag tgcagtaaac atttcaatgc 
aagcatggag cagtctttgg ccaaaggagc gaaaacgtca aagagcatTc 
1451 tttcttttTg gattagagct tattgtgcag ctagatattg aagcaaccag 
1501 aacattcttt agaaccttct tccgcttgcc aacttggatg tggtqggqtt 
Jen? tcctt 999tc ttcactatca tctttcgatc tcgtcttgtt ttccatgtac 
1601 atgtttgttt tggcgccaaa cagcatgagg atgtcacttg tgagacattt 
1651 gctttcagat ccttctggtg cagttatggt aagagcttac ctcgaaagqt 
1701 agtctcatct attattaaac tctagtgttt caccaaataa atgaggatcc 
1751 ttcgaatgtg tatatgatca tctctatgta tatcctgtac tctaatctca 
1801 taaagtaaat gccgggtttg atattgttgt gtcaaacegg ccaatgatat 
1851 aaagtaaatt tattgataca aaagtagttt ttttccttaa aaaaaaaa 
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FIG. I4B 



Adonis paloestina E-cyc] ase #5 predicted polypeptide 
• TRANSLATE from: 113 to: 1702 Length: 529 amino acids 

1 MELLGVRNLI SSCPVWTFGT RNLSSSKLAY NIHRYGSSCR VDFQVRADGG 

51 SGSRSSVAYK EGFVDEEDFI KAGGSELLFV QMQQTKSMEK QAKLADKLPP 

101 IPFGESVMDL VVIGCGPAGL SLAAEAAKLG LKVGLIGPDL PFTNNYGVWE 

151 DEFKDLGLER CIEHAWKDTI VYLDNDAPVL IGRAYGRVSR HLLHEELLKR 

201 CVESGVSYLD SKVERITEAG DGHSLVVCEN EIFIPCRLAT VASGAASGKL 

251 LEYEVGGPRV CVQTAYGVEV EVENNPYDPN LMVFMDYRDY MQQKLQCSEE 

301 EYPTFLYVMP MSPTRLFFEE TCLASKDAMP FDLLKRKLMS RLKTLGIQVT 

351 KVYEEEWSYI PVGGSLPNTE QKNLAFGAAA SMVHPATGYS WRSLSEAPK 

401 YASVIAKILK QDNSAYVVSG QSSAVNISMQ AWSSLWPKER KRQRAFFLFG 

451 LELIVQLDIE ATRTFFRTFF RLPTVJMWWGF LGSSLSSFDL VLFSMYMFVL 

501 APNSMRMSLV RHLLSDPSGA VMVRAYLER* 
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FIG. I5A 

CNA sequence' of potato cDNA (GenBank R27545) obtained from Nicholas J. 
Provart 

potato.seq Length: 1378 August 2, 1996 13:06 Type: N Check: 605 

1 tagcggnnnn naggatgagt tcaaagatct tggtcttcaa gcctgcattg 

51 aacatgtttg gcgggatacc attgtatatc ttgatgatga tgatcctatt 

101 cttattggcc gtgcctatgg aagagttagt cgccatttac tgcacgagga 

151 gttactcaaa aggtgtgtgg aggcaggtgt. tttgtatcta aactcgaaag 

201 tggataggat tgttgaggcc acaaatggcc acagtcttgt agagtgcgag 

251 ggtgatgttg tgattccctg caggtttgtg actgttgcat cgggagcagc 

301 ctcggggaaa ttcttgcagt atgagttggg aggtcctaga gtttctgttc 

351 aaacagctta tggagtggaa gttgaggtcg ataacaatcc atttgacccg 

401 agcctgatgg ttttcatgga ttatagagac tatgtcagac acgacgctca 

451 atctttagaa gctaaatatc caacatttct ctatgccatg cccatgtctc 

501 caacacgagt ctttttcgag gaaacttgtt tggcttcaaa agatgcaatg 

551 ccattcgatc tgttaaagaa aaaattgatg ttacgattga acaccctcgg 

601 tgtaagaatt aaagaaattt atgaggagga atggtcttac ataccagttg 

651 gaggatcttt gccaaataca gaacaaaaaa cacttgcatt tggtgctgct 

701 gctagcatgg ttcatccagc cacaggttat tcagtcgtca gatcactgtc 

751 tgaagctcca aaatgcgcct tcgtgcttgc aaatatatta cgacaaaatc 

801 atagcaagaa tatgcttact agttcaagta ccccgagtat ttcaactcaa 

851 gcttggaaca ctctttggcc acaagaacga aaacgacaaa gatcgttttt 

901 cctatttgga ctggctctga tattgcagct ggatattgag gggataaggt 

951 cattt'ttccg cgcgttcttc cgtgtgccaa aatggatgtg gcagggattt 

1001 cttggttcaa gtctttcttn agcagacctc atgttatttg ccttctacat 

1051 gtttattatt gcaccaaatg acatgagaag aggcttaatc agacatcttt 

1101 tatctgatcc tactggtgca acattgataa gaacttatct tacattttag 

1151 agtaaattcc tcctacaata gttgttgaan nagaggcctc attacttcag 

1201 attcataaca gaaatcgcgg tctctcgagg ccttgtatat aacattttca 

1251 ctaggttaat attgcttgaa taagttgcac agtttcagtt tttgtatctg 

1301 cttctttttt gtccaagatc atgtattgan ccaatttata tacattgcca 

1351 gtatatataa attttataaa aaaaaaaa 

poteps.pep Length: 378 TRANSLATE from: 14 to: 1147 

1 DEFKDLGLQA CIEHVWRDTI VYLDDDDPIL IGRAYGRVSR HLLHEELLKR 

51 CVEAGVLYLN SKVDRIVEAT NGHSLVECEG DVVIPCRFVT VASGAASGKF 

101 LQYELGGPRV SVQTAYGVEV EVDNNPFDPS LMVFMDYRDY VRHDAQSLEA 

151 KYPTFLYAMP MSPTRVFFEE TCLASKDAMP FDLLKKKLML RLNTLGVRIK 

201 EIYEEEWSYI PYGGSLPNTE QKTLAFGAAA SMVHPATGYS VVRSLSEAPK 

251 CAFVLANILR QNHSKNMLTS SSTPSISTQA WNTLWPQERK RQRSFFLFGL 

301 ALILQLDIEG IRSFFRAFFR VPKWMWQGFL GSSLSXADLM LFAFYMFI I A 

351 PNDMRRGLIR HLLSDPTGAT LIRTYLTF* 
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FIG. I5B 

Chimeric lettuce/potato lycopene e -cyclase: converts lycopene to 5- 
carotene, the lettuce cDNA converts lycopene to £ -carotene and the 
potato cDNA does not produce an active enzyre 

(amino acids in lower case are frcm lettuce and those in uppercase 
are -from the potato cDNA; an Avail site in ccmron to the two cDNAs 
was used to construct the chimera) 

1 mecfgarnmt atmavftcpt ftdcnirhkf sllkqrrftn lsassslrqi 

51 kcsaksdrcv vdkqgisvac eedyvkaggs elffvqmqrt ksmesqskls 

101 eklaqipign cildlvvigc gpaglalaae saklglnvgl igpdlpftnn 

151 ygvwqdefig lglegciehs wkdtlvyldd adpirigray grvhrdllhe 

201 ellrrcvesg vsylsskver iteapngysl iecegnitip crlatvasga 

251 asgkf leyel gGPRVSVQTA YGVEVEVDNN PFDPSLMVFM DYRDYVRHDA 

301 QSLEAKYPTF LYAMPMSPTR VFFEETCLAS KDAMPFDLLK KKLMLRLNTL 

351 6VRIKEIYEE EWSYIPVGGS LPNTEQKTLA FGAAASMVHP ATGYSVVRSL 

401 SEAPKCAFVL ANILRQNHSK NMLTSSSTPS ISTQAWNTLW PQERKRQRSF 

451 FLFGLALILQ LDIEGIRSFF RAFFRVPKWM WQGFLGSSLS XADLMLFAFY 

501 MFIIAPNDMR RGLIRHLLSD PTGATLIRTY LTF* 
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FIG, 16 

QfiP corparisbn of Arabidopsis £ -cyclase x potato £ -cyclase (partial) 

blosum62.cmp Gap Weight: 12 Average Match: 2.912 

Length Weight: 4 Average Mismatch: -2.003 

Quality: 1485 Length: 529 

Ratio: 3.929 Gaps: 1 

• Percent Similarity: 79.893 ■ Percent Identity: 76.139 
Match display thresholds for the al ignment(s) : 

= IDENTITY : = 2 . • . = 1 



151 EDEFNDLGLQKCIEHVWRETIVYLDDDKPltlGRAYGRVSRRLLHEELLR 200 

Ml lllll llllllhllllllir II IIIIIMIII lllllll: ' 

1 . DEFKDLGLQACI EHVWRDTIVYLDDDDPI LIGRAYGRVSRHLLHEELLK 49 

201 RCVESGVS YLSSKVDS I TEASDGLRLVACDDNNV I PCRLATVASGAASGK' 250 

INI. II II. MM I II.. I II J: • Mill IIIIIMIII 
50 RCVEAGVLYLNSKVDRI VEATNGHSLVECEGDVVI PCRFVTVASGAASGK 99 

251 LLQYEVGGPRVCVQTAYGVEVEVENSPYDPDQMVFMDYRDYTNEKVRSLE 300 

MM. Mill MIIIIMIIhl.hll IIIMIMI -III 

100 FLQYELGGPRVSVQTAYGVEVEVDNNPFDPSLMVEMDYRDYVRHDAQSLE 149 
301 AEYPTfLYAMPMTKSRLFFEETCLASKDVMPFDLLKTKLMLRLDTLGIRi 350 

Ml I'll I Mill. .MINIMUM lllllll llllll.llhll 

150 AKYPTFLYAMPMSPTRVFFEETCLASKDAMPFDLLKKKLMLRLNTLGVRI 199 

351 LKTYEEEWSYIPVGGSLPNTEQKNLAFGAAASMVHPATGYSVVRSLSEAP 400 
IMIMIMM 



200 KEI YEEEWSYI PVGGSLPNTEQKTLAFGAAASMVHPATGYSVVRSLSEAP 249 
401 KYASVIAEI LREETTKQI . '. . . . NSNISRQAWDTLWPPERKRQRAFFLFG 445 

I I hi MM -I • Ml MM MM MUM. lllll 

250 KCAFVLANI LRQNHSKNMLTSSSTPSISTQAWNTLWPQERKRQRSFFLFG 299 
446 LALIVQFDTEGIRSFFRTFFRLPKWMWQGFLGSTLTSGDLVLFALYMFVI 495 

HIM I I 1 1 1 1 1 1 i 1 III- IMIMIMM. |. II. Ill IMM 

300 LALILQLDIEGIRSFFRAFFRVPKWMWQGFLGSSLSXADLMLFAFYMFII 349 

496 S PNN LRKG L I NH LI SDPTGA7M I KTY LKV 524 

.||.:|:||| I f r 1 1 1 1 1 1 I r | r | | I 
350 APNDMRRGLIRHLLSDPTGATLIRTYLTF 378 
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FIG. I7A 



Adonis palaestina Ipil 

1 attcatcttc agcagcgctg tcgtactctt tctatatctt cttccatcac 

51 taacagtagt cgccgacggt tgaatcggct attcgcctca acgtcaacta 

101 tgggtgaagt cactgatgct ggaatggatg ctgttcagaa gcggctcatg 

151 ttcgacgacg aatgtatttt ggtggatgag aatgacaagg tcgtcgggca 

201 tgattccaaa tacaactgtc atttgatgga aaagatagag gcagaaaatt 

251 tgcttcacag agccttcagt gttttcttgt tcaactcaaa atatgaattg 

301 cttcttcagc aacgatccgc cacaaaggta acattcccgc tcgtatggac 

351 aaacacatgt tgcagtcatc ctctctttcg tgattccgag ctcatagaag 

401 aaaattatct cggtgtacga aacgctgcac aaagaaagct tttagacgag 

451 ctaggcattc cagctgaaga tgtcccagtt gatgaattta ctcctcttgg 

501 tcgcattctt tacaaagctc catctgacgg caaatgggga gagcacgaat 

551 tggactatct cctatttatt gtccgagatg tgaaatacga tccaaaccca 

601 gatgaagttg ctgatgctaa gtatgttaat cgcgaggagt tgagagagat 

651 actgagaaaa gctgatgctg gtgaagaggg actcaagttg tctccttggt 

701 ttagattggt tgttgataac tttttgttca agtggtggga tcatgtagag 

751 cagggtacga ttaaggaagt tgctgacatg aaaactatcc acaagttgac 

801 ttaagaggac ttctctcctc tgttctacta tttgtttttt gctacaataa 

851 gtgggtggtg ataagcagtt tttctgtttt ctttaattta tggcttttga 

901 atttgcctcg atgttgaact tgtaacatat ttagacaaat atgagacctt 

951- gtaagttgaa tttgaggctg aatttatatt tttgggaaca taataatgtt 

1001 aa 
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FIG. 17 B 

Adonis palaestina Ipi2 

1 ttttaaagct ctttcgctcc accaccatca. aagccagcca aatttctctg 

51 tacaaaagtt aaaaacaccg ctttgggctt tggcccctcc atatcggaat 

101 ccttgtttac gatacgcatc taaaccagta attctcggtt ttaatttgtt 

151 tcctaaatta ggcccctttc cggaatcccg agaattatgt cgtcgatcag 

201 gattaatcct ttatatagta tcttctccac caccactaaa aeattatcag 

251 cttcgtgttc ttctcccgct gttcatcttc agcagcgttg .tcgtactctt 

301 tctatttdtt cttccatcac taacagtcct cgccgagggt tgaatcggct 

351 gttcgcctca acgtcgacta tgggtgaagt cgctgatgct ggtatggatg 

401 ccgtccagaa gcggcttatg ttcgacgatg aatgtatttt ggtggatgag 

451 aatgacaagg tcgtcggaca tgattccaaa tacaactgtc atttgatgga 

501 aaagatagag gcagaaaact tgcttcacag agccttcagt gttttcttat 

551 tcaactcaaa atacgagttg cttcttcagc aacgatctgc aacgaaggta 

601 acattcccgc tcgtatggac aaacacctgt tgcagccatc ccctcttccg 

651 tgattccgaa ctcatagaag aaaattttct cggggtacga aacgctgcac 

701 aaaggaagct tttagacgag ctaggcattc cagctgaaga cgtaccagtt 

751 gatgaattca ctcctcttgg tcgcattctt tacaaagctc catctgacgg 

801 aaaa'tgggga gagcacgaac tggactatct tctgtttatt gtccgagatg 

851 tgaaatacga tccaaaccca gatgaagttg ctgacgctaa gtacgttaat 

901 cgcgaggagt tgaaagagat actgagaaaa gctgatgcag gtgaagaggg 

951 aataaagttg tctccttggt ttagattggt tgtggataac tttttgttca 

1001 agtggtggga tcatgtagag gaggggaaga ttaaggacgt cgccgacatg 

1051 aaaactatcc acaagttgac ttaagagaaa gtctcttaag ttctactatt 

1101 tggtttttgc ttcaataagt ggatggtgat gagcagtttt tatgcttcct 

1151 ttaattttgg cttttcaatt tgctttatgt gttgaacttg taacatattt 

1201 agtcaaatat gagaccttgt gagttgaatt tgaggttata tttatagttt 

1251 tgggaacata aaaaaaaaaa 
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F 1 6: I8A 

H TlSart U ggc«caitc J gctatttgga acctggcccg gcggcagtcc 

51 gltg?cgcga tgcttcgttc gttgctcaga ggcctcacgc atatcccccg 

ini rataaactcc qcccagcagc ccagctgtgc acacgcgcga ctccagttta 

lSi aactcaqgag wtgcigatg acgrtcatgc agcccagcat ctcagccaat 

Im ctotcqcqcg ccglggaccg cacagaccac atgaggggtg caagcacctg 

k\ aacaaqcqqq cagtcgcagg atgagctgat gctgaaggac gagtgcatct 

\m taataqatqt tgiggicaac atcacaggcc atgccagcaa gctggagtgt 

? 5 . SSSttcc tlcEIcatca gcctgcaggc ctgctgcacc gggccttctc 

w\ tqtattcctq tttgacgatc aggggcgact gctgctgcaa cagcgtgcac 

451 gctSaaaat cacctt?cca agtgtgtgga cgaacacctg ctgcagccac 

501 cctttacatg ggcagacccc agatgaggtg gaccaactaa gccaggtggc 

551 cgacggaaca gtacctggcg caaaggctgc tgccatccgc aagttggagc 

601 acqagrtggg gataccagcg caccagctgc cggcaagcgc g«tcgcttc 

65 cffig?? !gcactactg tgccgcggac Qtgcagccag gragaca 

& SSc SS83S S S3 m Sffi 

1 ili ssfe ass SS3BS agg 

Tnm aacacqtcat ggggtggaat tgcgtacttg gcagcttcgt atctcctttt 

i li& SMs sBss sssss ssss 

1151 aaaaaaaaaa aaaaa 
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FIG. I8B 



Haemotococcus pluvialis Ipi2 

1 tggaacctgg cccggcggca gtccgatgcc gcgatgcttc gttcgttgct 

51 cagaggcctc acgcatatcc cgcgcgtgaa ctccgcccag cagcccagct 

101 gtgcacacgc gcgactccag tttaagctca ggagcatgca gctgcttgcc 

151 gaggaccgca cagaccacat gaggggtgca agcacctggg caggcgggca 

201 gtcgcaggat gagctgatgc tgaaggacga gtgcatctta gtggatgctg • 

251 acgacaacat cacaggccat gccagcaagc tggagtgcca caaattcCta 

301 ccacatcagc ctgcaggcct gctgcaccgg gccttctctg tgttcctgtt 

351 tgacgaccag gggcgactgc tgctgcaaca gcgtgcacgc ' tcaaaaatca 

401 ccttcccaag tgtgtggacg aacacctgct gcagccaccc tctacatggg 

451 cagaccccag atgaggtgga ccaactaagc caggtggccg acggcacagt 

501 acctggcgca aaagctgctg ccatccgcaa gttggagcac gagctgggga 

551 taccagcgca ccagctgccg gcaagcgcgt ttcgcttcct cacgcgtttg 

601 cactactgtg ccgcggacgt gcagccggct gcgacacaat cagcgctctg 

651 gggcgagcac gagatggact acatcttatt catccgggcc aacgtcacct 

701 tggcgcccaa ccctgacgag gtggacgaag tcaggtacgt gacgcaagag 

751 gagctgcggc agatgatgca gccggacaac gggttgcaat ggtcgccgtg 

801 gtttcgcatc atcgccgcgc gcttccttga gcgttggtgg gctgacctgg 

851 acgcggccct aaacactgac aaacacgagg attggggaac ggtgcatcac 

901 atcaacgaag cgtgaaggca gaagctgcag gatgtgaaga cacgtcatgg 

951 ggtggaattg cgtacttggc agcttcgtat ctcctttttc tgagactgaa 

1001 cctgcagagc tagagtcaat ggtgcatcat attcatcgtc tctcttttgt 

1051 tttagactaa tctgtagcta gagtcactga tgaatccttt acaactttca 

1101 aaaaaaaaa 
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FIG. I9A 

Lactuco sativa Ipil ». * 

1 tgccaaaatg ttgaaatttc ccccttttaa aaccattgct accatgatct 

51 cttctccata ttcttccttc ttgctgcctc ggaaatcttc tttccctcca 

101 atgccgtctc tcgcagccgc tagtgttttc ctccaccctc tttcgtctgc 

151 cgctatgggc gattccagca tgqatgctgt ccagcgacgt ctcatgttcq 

201 afgacgaatg cattttggtg" gatgagaatg acaaagtggt tggccatgat 

• 251 actaaataca attgtcattt qatggagaag attgaaaagq gaaatatgct 

301 acacagagca ttcagtgtgt tcttgttcaa ctcgaaatat gaattactcc 
351 ' ttcagcaacg ttctgcaacc aaggtgactt tccctttggt atggacaaac 

401 acgtgttgca gccatccact atacagggag agtgaqctta ttqacgaaaa 

451 cqcccttggg gtgaggaatg ctgcacagag gaaqctcctg gatgaactcg 

501 gcatccctgg agcagatgt! ccgqttgatg agttcactcc attgggtcgc 

551 attctataca aggccgcatc gqatqgaaag tqqggaqaac atgaacttqa 

601 ttacctgctg ttt atggtac gtgatgttgg tttggatccg aacccagatg 

651 aagtgaaaga tgtaaaatat gtgaaccgqg aagagctgaa ggaattqgta 

701 aggaaggcgg atgctgqtga agagqgtgtg aagctgtccc cgtggttcaa 

751 " attgattgtc gataatttct tgtttcagtg gtgqgatcga ctccataagg 

801 gaaccctaac cgaagctatt gatatgaaaa caatccacaa jctcacataa 

851 aaacactaca claglaggag jgagqattat atgagatatt tgttatatgt 

901 qaaattgaaa ttcagatgaa tgcttgtatt tatttctatt tggacaaact 

951 tcaacttctt tttgctacct tatcagaaaa aaaaa 

FIG.I9B 

i0Ct \ C tattcgcttc P afaatctctt ccattaactg ctcaaatctc caccttcgcc 

. 51 ggtcttaatc tccgccggcg cactttcacc accataaccg ccgccatgqg 

'101- tgacgattcc ggcatggacg ctgtccagag acgtctcatg tttgatgatg 

151 aatgcatttt ggttgatgaa aafgacaatg ttcttgggca tgafaccaaa 

201 tacaattgtc acttgatgga gaagattgag aaagataatt tgcttcatag 

251 agcattcagt gtatltttat tcaattcaaa atacgaatta ctccttcagc 

301 aaaggtcaga aaccaaggtg acatttcctt tggtatggac aaacacctgt 

351 tgcagccatc cactatacag agaatcggag tlaattcccg aaaatgccct 

401 tggggtcaga aatgctgcac agaggaagct tctaqatgaa ctcggtatcc 

451 clgctgaaga tgttccagtt gatgagttca caactttagg tcgcatgttq 

501 tacaaggctc catctgatgg aaaatgqggt gaacatgaag ttgattacct 

551 actct?cctc gtgcglgacg ttgccgfgaa cccaaaccc! gatgaggtgg 




601 

701 ggtggacaac ?tct!gtfca aatggtgaqa tcatgtccaa aaggggacac 

751 tcaatgaagc aattgacatg aaaaccatlc ataaqttgat atgaaaaatg 

801 qttaatattt atggtggtgg tttgqagcta ataatttgtg tgttcaagtc 

85 ?cggtccttc ttStSaS gttStf ttt tttcttt!}! tgggagtgtt 

901 tattgtgtac ttgtaacgta ggccctttgg ttacgcttta agagtttaat 

951 aaagaaccac cgttaattta aaaaaaaaaa aaaaaaaa 
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FIG.20 

Chlamydomonas reinhordtii Ipil 

(Note: the isomerase cDNA probably ends at ca. base 1103; the 
second half of the cDNA is similar to extensm and other 
hydroxyproline-rich structural .proteins) 

1 qqcacgagct cgagtttgtt ttaccatgac atcgggaatt tggaagcttq 

51 aactacctca attactcaag taactcgcgg caacacattt cgcgcgccat 

101 cqctgttttc tctgctccag ctaccgagca . gcattgcttt agatcgct.tt 

151 qatgtcataa actcccactt atatgagatc cagtttcatc gagcccaagc 

201 ccagagcgca acctgtctta agccgcggca gggcgtccat gcgcctcgcg 

251 caaagccgtg ctctcgttgc gcgtqtcagc tccgccctgt ggccgggagc 

301 aqgactttca' caggctcaaa gcgttgcgqt gcqaatqgcg agttcgtcaa 

351 cctgggaagg cacgggcctg agccaggatg acttcatgca gcgggacgag 

401 tgcttggtgg tggacgagca ggaccggctg ctaggcaccg ccaacaagta 

451 cgactgccac cgcttcgagg cggccaaggg ccagccctgc ggccgcctqc 

501 accgcgcctt ctccgtgttc ctgttcagcc ccgacggccg actgctgctg 

551 cagcagcgcg cagccagcaa ggtgacgttc ccgggtgtgt ggaccaacac 

601 ctgctgctcg cacccgctgg cgggccaggc gccggacgag gtggacctqc 

651 cggcggcggt agcctcgggc caggtgccgg gcatcaaggc qgcggcggtg 

701 cgcaagctgc agcacgagct gggqataccg ccggagcagg ttcccgcctc 

751 ctccttctcc ttcctcacgc qtctgcacta ctgcgccgcc gacaccgcca 

801 cgcacggccc ggcggcggag tggggcgaqc acgaggtgga ctacgtgctq 

851 ttcgtgcggc cgcagcagcc cgtcagcctg cagcccaacc cagacgaggt 

901 qgacgccacg cgctacgtga cgctgccgga gcttcaqtcc atgatggcgg 

951 accccggcct cagctggagc ccctggttcc gcatcctggc cacacagccc 

1001 gccttcctgc ccgcctggtq gggcgacctq aagcggcgct gqcgcccggg 

1051 cqgcagccga ctgtaggact ggggcaccat ccaccgcgtc atgtgaagaa 

1101 aaaggggaag caggggcggg agcgggggat gaatgggaat gtqaatgcga 

1151 ttqtgatgcg gcgtgggatg aggtctgaag acagggggaa aatcgggggg 

1201 cgggcgtgag cgtgtgtgta cgtgagcgac aaagccggga ggcgqaccgc 



1401 ggcgggcctc acfcctggtc gtgcccagtg gtctcqtggg cagaqtgqca 

1451 qqggctgcac ccatatgagc ggcgcactgc cgcgctgggc taagtcctta 

1501 tcacttggtg aggtggggcg agqtggctgt gggcggcggg cgcagtggca 

1551 gaaggacacg gtgtgtgagc ggtggagcfc tggccgtgcc ggccgtgaqg 

1601 ggcggatagc gatatgacgt Igtgcttggc cgctg!aatg cgggagaatg 

1651 tgcaggccgc gagaagcgqg cggtggcagg aggccgcagg ctgcagcacc 

1701 cgttggggag gtgccgcctg caggcgcggc gccgggcggg cctgagtaat 

1751 gggcgcctga gtagtggcgg ccacaggagg cgcaqgaggc agcagcagqa 

1801 ggacgagctg gagggacccg ttggcaaccc aaggffgcgc gtgtaacata 

1851 gtggccatac aaaaaaaaaa aaaa 
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FIG. 21 A 

Tagetes erecta Ipil . 

1 ccaaaaacaa ctcaaatctc ctccgtcgct cttactccgc catgggtgac 

51 gactccggca tggatgctgt tcagcgacgt ctcatgtttg acgatgaatg 

101 cattttggtg gatgagtgtg acaatgtggt gggacatgat accaaataca 

151 attgtcactt gatggagaag attgaaacag gtaaaatqct gcacagagca 

201 ttcagcgttt ttctattcaa ttcaaaatac gagttacttc ttcagcaacg 

• 251 gtctgcaacc aaggtgacat ttcctttagt atggaccaac acctgttgca 

301 gccatccact ctacagagaa tccgagcttg ttcccgaaaa cqcccttqga 

351 gtaagaaatg ctgcacagag gaagctgttg gatgaactcg gtatccctgc 

401 tgaagatgtt cccgttgatc agtttactcc tttaggtcgc atgctctaca 

451 aggctccatc tgatggaaag tggggagaac atgaacttqa ctacctactt 

501 ttcatagtga gagacgtt'gc tgtaaacccg aacccagatg aagtggcgga 

551 tatcaaatat gtganccang aagaqttaaa ggagctqcta aqqaaagcag 

601 atgcggggga ggagggtttg aagctgtctc catggttcag gttagtggtt 

651 gataacttct tgttcaagtg gtgqgatcat gtgcaaaagg gtacactcac 

701 . tgaagcaatt gatatgaaaa ccatacacaa gctqatatag aaacacaccc 

751 tcaaccgaaa agttcaagcc taataattcg qgttqgqtcg ggtctaccat 

801 caattgtttt tttcttttaa gaagttttaa tctctatttg agcatgttga 

851 ttcttgtctt ttgtgtgtaa gattttgggt ttcqtttcag ttgtaataat 

901 gaaccattga tggtttgcaa tttcaagttc ctatcgacat gtagtgatct 

951 aaaaaa 

F I G. 2 1 B 

Oryzo sative Ipil w , 

1 cctccctttg cctcgcgcag aggcggccgc gccttctccg ccgcgaggat 

•51 ggccggcgcc gccgccgccg tggaggacgc cggqatggac gaggtccaga 

•101 agcggctcat gttcgacgac gaatgcattt tggtqgatga acaagacaat 

151 qttqttggcc atgaatcaaa atataactgc catctgatqg aaaaaatcga 

201 atctgaaaat ctacttcata gggctttcag tgtattcctg ttcaactcaa 

251 aatatgaact cctactccag caacgatctg caacaaaggt tacatttcct 

301 ctagtttgga ccaacacttg ctgcagccat cctctgtacc gtgagtctga 

351 gcttatacag gaaaactacc ttggtgttag aaatgctgct cagaggaagc 

401 tcttggatga gctgggcatc ccagctgaag atgtgccagt tgaccaattc 

451 acccctcttg gtcggatgct ttacaaggcc ccatctgatg gaaaatgqgg 

501 tgaacacgag cttgactacc tgctgttcat cgtccgcgac gtgaaggtag 

551 tcccgaaccc ggacgaagtg gccgatgtga aatacgtgag ccgtgagcag 

601 ctgaaggagc tcatccgcaa agcggacgcc ggagaggaag gcctgaagct 

651 gtctccctgg ttccggctgg ttgttgacaa cttcctcatg ggctggtggg 

701 atcacqtcga gaaaggcacc ctcaacgagg ccgtggacat ggagaccatc 

751 cacaagctga agtaaggact gcgatgttgt ggctggaaag aatgatpctg 

801 aagactctgt tcttgtgctg ctgcatatta ctcttaccag ggaagttgca 

851 gaagtcagaa gaagcttttg tatgtttctg ggtttggagc ttgqaaqtgt 

901 tgggctctgc tgactgagag attcccttat aqagtqtcta tqttaattta 

951 gcaaacttct atattataca tgattagtta attgttcggt gtctgaataa 

1001 agaacaatag catgttccat gtttatttgc t 
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FIG. 23A 



FIG.23B 
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. F I G. 23A 

• 

Comparison using GAP program of the Genetics Computer Group • 

Gap Weight: 50 Average match: 10.000 

Length Weight: 3 Average Mismatch: 0.000 

Quality: 17392 " Length: 1904 

Ratio: 9.411 Gaps: 3 

Percent Similarity: 95.331 Percent Identity: 95.331 • 
Match display thresholds for the al ignment(s) : 

= IDENTITY : = 5 . = 1 



Adonis polaestina £ -cyclase #3 x Adonis paloestina £ -cyclase #5 

• • • • • 

1 gagagaaaaagagtgttatattaatgttactgtcgcattcttgcaacac. 49 

III TIT) Til I Nil III]] 1 11111111111)1111 llll 

1 aaaqqaqtqttctattaatqttactqtcqcattcttacaa< 



qcaacact 44 



50 .atattcagactccattttcttgttttctcttcaaaacaacaaactaatg 98 

" iiiiini ii i 



45 ta 



Jlllll IlllllllilllJ llll II III 
tattcaaactccattttcttcttttctctt 



:caaaacaacaaactaatg 94 



aactacttggtgttcgcaacctcatctc 147 
ggctatggaactacttggtgttcgcaaccica- xlc 144 



99 tqa . cgqaqtatctaqctatqqaactacttqqtgttcq 

nc ill i iiiiiiii in Mil mi i in mi I JIT 

95 tgagcagagtatctggctatggaactacttggtgttcg 

• • • • • 

148 ttcttgccctgtctggacttttggaacaagaaaccttagtagttcaaaac 197 

,« limimiu iTii ii miiii i in iiiii iiiii iiiiiiii ii in „ 

145 ttcttgccctgtgtggacttttggaacaagaaaccttagtagttcaaaac 194 

• • • • • 

198 tagcttataacatacatcgatatggttcttcttgtagagtagattttcaa 247 

,nc I I'll J J 1 J 1 1 1 1 J 1 1 1 J 111 J I JTTJ:J I J J 1 J JTJ IT1TJ ITI J J J J 1 1 1 " 

195 tagcttataacatacatcgatatggttcttcttgtagagtagattttcaa 244 

• • • • • 

248 qtqaqgqctqatqqtqqaaqcqqqaqtaqaacttctqttqcttataaaqa 297 

o,c TJT IT T M T I J IT ITT 1 1 T I ITT 1 T J I T 1 1 H llll Jllll I) 1 1 ITI _ 

245 gtgagagctgatggtggaagcgggagtagaagttctgttgcttataaaga 294 



298 



295 gggttttgtggatgaagagga 



tatcaaagctggtggttctgagcttt 347 

1 J I JTITI JJl , nn 

ittctgagcttt 344 
aggccaagctc 397 

,« , iiiiiiiiiii _ 

345 tgtttgtccaaatgcagcaaacaaagtctatggagaaacaggccaagctc 394 



111 I ITI1TT... ........... 

atcaaagctggtggttctgagcttt 344 

• • • • • 

348 tgtttgtccaaatgcagcaaacaaagtctatggagaaacaggccaagctc 397 

{T{[p|iniiJTimii iiiiiiii imm iiiinii i iTii. 
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FIG. 23 B 



398 gccgataagt 

one TIITIJIITJ 

395 gecgataagt 



tgccaccaa 

III nun 

tgccaccaa 



• • • 

tacctttcqqagaatctgtgatggacttqgt 447 

'■■jji it mi 1 1 ii I mm i Nil ii 

ttttggagaatccgtgatggacttqqt 



accttttggagaatccgtgatggacttggt 444 



448 tqtaataqqttgtggacctgctggtctttcactg^ 497 
445 igiaataggttgtggacctgctggtctttcactggctgcagaagctgcta 494 
498 aqctaqqcttqaaagttqqccttattqqtcctgatcttccttttacaa 547 

AO c mi JiTT mill j mill] i IT J 1 1 JT 1 1 1 1 1 1 1 J J 11 1 r ■ < « i 

495 agctagggttgaaagttggccttattggtcctgatcttccttttc 



:acaaat 544 

• • • • 

ttatggtgtgtgggaagacgagttcaaagatcttggacttg^ 

=« ■ ■]! I J TTjTjT J TTT I 111 ITITI J 1 1 1 II 1 1 1 1 ITT 1 1 1 IT I . . 

545 aattatggtgtgtgggaagacgagttcaaagatcttggacttgaacgttg 594 



wm 



iiiiiniimmiimm ttttttt mm r 

ittatggtgtgtgggaagacgagttcaaagatcttggacttc 

cnc II . „ ............. .. 

595 tatcgagcatgcttggaaggacaccatcgtatatcttgataatgatgctc 644 

tcgtgcatatggacgagttagtcgacatttqctacat 



598 tatcqaqcatgcttggaaggacaccatcgtatatcttgacaatqatqctc 647 

. I J IT III I1TMJTT I ITTI 1 1 1 1 1 J III I J M Mill 11)1111111 

tatcgagcatgcttggaaggacaccatcgtatatcttgataatgatqctc 



648 ctgtcc 

™ 'I I]' I 
645 ctgtcc 




attgg 



694 



698 qaaqaq ( ttqctqaaaaqgtgtgtcgagtcaggtgtatcatatctgaattc 747 

ffl J TIT1JTI]TIIIITTITITI 1 111 I llljll III III 1 1 IT III I 

695 gaggagttgctgaaaaggtgtgtggagtcaggtgtatcatatctggattc 744 
748 taaagtggaaaggatcactgaagctggtgatggccatagtcttgtagttt 797 

in i Tin 1 1 1 Hi ii 1 1 m iii mm ini 1 1 1 it i iiti itiij 

745 taaagtggaaaggatcactgaagctggtgatggccatagccttgtagttt 794 

• • • • a 

798 gtgaaaacgacatctttatcccttgcaggcttgctactgttgcatctgga 847 

7QC TITIill Tl I J I JIJ I J 1 1 1 1JTI ITTI J JTII I IJTJ 1TI I J I ITT I 

795 gtgaaaatgagatctttatcccttgcaggcttgctactgttgcatctgga 844 

• • • • • 

848 gcagcttcagggaaacttttggagtatgaagtaggtggccctcgtgtttg 897 

q/ic II III II I ITTTIII 1 1 II I IT IT] I III IT) I Til III I II ITITI I IT 

845 gcagcttcagggaaacttttggagtatgaagtaggtggccctcgtgtttg 894 

• • • • , 

898 tgtccaaactgcttatggtgtggaggttgaggtggagaacaatccatacg 947 

onc ITIIIIIII TIIIIITT T I TT I TT I j T I TT J T7 JTIMMJIIMIIT 

895 tgtccaaaccgcttatggggtggaggttgaggtggagaacaatccatacg 944 
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FIG. 23C 



• • • 

948 atcccaacttaatggtatttatggactacagagactatatgcaacagaaa 997 

llllllllllllinill] l|lTll|lllimi|l|l]llllMlMl QQfl 

945 atcccaacttaatggtattcatggactacagagactatatgcaacagaaa 994 



• • • - - 

998 ttacagtgctcggaagaagaatatccaacatttctctatqtcatgcccat -1047 

. iiiiiTiiiiininMiii|.iiiiiiiiiiiiiiiii , 

995 ttacagtgctcggaagaagaatatccaacatttctctatgi 



• * • 

qtcqccaacaagacttttttttgaggaaacctgtttggcctcaaaaqatg 

Tl I T 1 1 M M IT M 1111111 JT ITT 1 1 1 1 1 J T j J |TT t M 1 1 1 1 1 T t T T 

gtcgccaacaagacttttttttgaggaaacctgtttggcctcaaaagatg 

• • • • • 

ccatgcctttcgatctactgaagagaaaactaatqtcacqattqaagact 

i iimi mm mm i mi i m li imiiiii miiiii 

ccatqccattcgatctactgaagagaaaactgatgtcacgattgaagact 

atgaag'aggaatggtcttatatt 

"ITTTITITTTTTIf 1 11111111111111)1111 mill 

;tqqqtatccaagttacaaaaqtttatqaaqaqqaatqqtcatatatt 



1148 ctgggtatccaagttacaaaaatttatgaagaqqaatqqtcttatattcc 1197 

IITTTIIlllllimillll 1 1] I JTI iTlTTi 1 ITT I M IIIII 11M 

1145 ctgggtatccaagttacaaaagtttatgaagaggaatggtcatatattcc 1194 

• • • • • 

1198 tgttqggggttctttaccaaacacagaqcaaaaqaacctaqcatttqqtq 1247 

mill mil 111 1 1 mini nil 1 1 mil 1 1111111111111 

1195 tgttggtggttctttaccaaacacagagcaaaagaacctagcatttggtg 1244 
1248. ctgcagcaagcatggtgcakcaqcaacaqqctattcqqttqtacqatca 1297 

. 1 IT 1 IT 1 1 IT 1 1 ITT J T 1 1 J 1 1 IT 1 1 1 1 ITT 11 1 J 1 1 TtTTT T I IT III 19QA 

1245 Gtgcagcaagcatggtgcatccagcaacaggctattcggttgtacggtca 1294 

1298 ctatcagaaqctccaaaatatgcttctataattqcaaaqattttqaaqca 1347 

I 1 1 III ITIIII I Ml) Ml 11 III III 1 1 1 1 T 1 1 1 1 IT 1 1 T 1 1 
1295 ctgtcagaagctccaaaatatgcttctgtaatt 1344 

1348 agataactctqcatatqtqqtttctqqacaaaqcaqtqcaqtaaac^ 1397 

IT I J 1 1 1 J I IT I lllTlTflll Ti l J TTil 11)1 II Mill 17Q , 

1345 agaUactctgcgtatgtggtUctggacaaagtagtgcagt 1394 

• • * * 

1 ^qr raataraaacataaaqcaqtctttagccaaaggagcgaaaacgtcaaaga 1447 
1398 caat <^*M <fl ^ ( , , { ^ , , ^ 

1395 caatgcaagcaUgagcagtcUtggccaaagga 1444 



gcal 



9963055A1_IA> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/63055 PCT/US99/1212I 



36/45 



1498 aaccagaacgttctttagaaccttcttccgcttgccaacttggatgtggt 

1/1QC MIIIIIII J J" I J J J I T 1 1 1 1 J J 1 1 J 1 1 T I J JT 1 1 1 1 1 J I TT I J T J TT J 

1495 aaccaga-acattctttagaaccttcttccgcttgccaacttggatgtggt 

■• • • • • 

1548 ggggtttccttgggtcttcactatcatctttcgatcttgtattgttttcc 

i« K TnTTTJ I J 1 1 J JTTTI I J J f 1 1 J I J 1 1 J I J J J IT I J I J II 111111111 

1545 ggggtttccttgggtcttcactatcatctttcgatctcgtcttgttttcc 

• * • • • 

1598 atgtacatgtttgttttggccccgaacagcatgaggatgtcacttgtgag 

1CQC II U 1 1 11 TlJl nil! TT I H 1 1 1 i.TI I JTITTI JTJ 1 1 1 J-ITJTIT 

1595 atgtacatgtttgttttggcgccaaacagcatgaggatgtcacttgtgag 

• • • • • 

1648 acatttgctttcagatccttctggtgcagttatggttaaagcttacctcg 

nc , c 1 1 IXJ-J-T I J J J I ITI J I i J Jl ITTJ.TI ITJJ.1 JTTI I IT 1 1 1 1 1 1 J IT 

1645 acatttgctttcagatccttctggtgcagttatggtaagagcttacctcg 

• • • • • 

1698 aaaggtaatc . . . tgttttatgaaactatagtgtctcattaaataaatga 

ncoc miU 1 IK 1 1 IIUJ 111 ! I' 'ITI }1 1 1 1 in i in 

1695 aaaggtagtctcatctattattaaactctagtgtttcaccaaataaatg 
1745 ggatcct'tcgtatatgtatatgatcatctctatgtatatcctatattct 

„« TTIIIimT II ITIIII1TIIIIJIIIIIITIIIIIIIJ II II 

1745 ggatccttcgaatgtgtatatgatcatctctatgtatatcctgtactct 



1795 atctcataaagtaatcgaaaattcattga 

17QC ii mi ii 1 1 nil i J i j 

1795 atctcataaagtaaatgccgggtttgata 



agaaaaaaaaaaaaaaaaaa 

A* ♦J" 11 

tgttgtgtcaaaccggccaa 



1845 aaaa 

1845 tgatataaagtaaatttattgatacaaaagtagtttttttttttaaaaaa 



FIG. 23 D 
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FIG 24 A 

GAP program of Genetics Computer Group 
bl osum62.cmp 

Gap Weight: 12 Average Match: 2.912 

Length Weight: 4 Average Mismatch: -2.003 
Quality: 2728 Length: 530 

Ratio: 5,147 Gaps: 0 

Percent Similarity: 99,623 Percent Identity: 99.057 
Match display thresholds for the al'ignment(s) : 

= IDENTITY : = 2 . = 1 



Adonis palaestina £-cyclase #3 x Adonis palaestina £ -cyclase 

1 MELLGVRNLISSCPVWT FGT RNLSSSKLAYNIHRYGSSCRVD FQ V RADGG 50 

IIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIII 
1 MELLGVRNLISSCPVWTFGTRNLSSSKLAYNIHRYGSSCRVDFQVRADGG 50 

51 SGSRTSVAYKEGFVDEEDFIKAGGSELLFVQMQQTKSMEKQAKLADKLPP 100 
IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 



51 SGSRSSVAYKEGFVDEEDFIKAGGSELLFVQMQQTKSMEKQAKLADKLPP 100 

101 IPFGESVMDLVVIGCGPAGLSLAAEAAKLGLKVGLIGPDLPFTNNYGVWE 150 

IIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 
101 IPFGESVMDLVVIGCGPAGLSLAAEAAKLGLKVGLIGPDLPFTNNYGVWE 150 

151 DEFKDLGLERCIEHAWKDTIVYLDNDAPVLIGRAYGRVSRHLLHEELLKR 200 
II 



151 DEFKDLGLERC I EHAWKDT I V Y LDNDAPV LI GRAYGRVSRH LLHEELLKR 200 

201 CVESGVSYLNSKVERITEAGDGHSLVVCENDI FI PCRLATVASGAASGKL 250 

||||||ll|.|llllllllllllllllll|:|lirilllMIIIIIIII 
201 CVESGVSYLDSKVERITEAGDGHSLVVCENEIFI PCRLATVASGAASGKL 250 

251 LEYEVGGPRVCVQTAYGVEVEVENNPYDPNLMVFMDYRDYMQQKLQCSEE 300 

IIIIIMMMIIIIIIIIIIIIMIIIIIIIIIMIIIIMIMIIIII 

251 LEYEVGGPRVCVQTAYGVEVEVENNPYDPNLMVFMDYRDYMQQKLQCSEE 300 

301 EYPTFLYVMPMSPTRLFFEETCLASKDAMPFDLLKRKLMSRLKTLGIQVT 350 

II 



301 EYPTFLYVMPMSPTRLFFEETCLASKDAMPFDLLKRKLMSRLKTLGIQVT 350 
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351 KIYEEEWSYIPVGGSLPNTEQKNLAFGAAASMVHPATGYSVVRSLSEAPK 400 

MMMMM MINIM I MINIMI MINIMI | j 1 1 1 1 1 1 1 1 1 

351 mEEEWSYIPVGGSLPNTEQKNLAFGAAASMVHPATGYSVVRSLSEAPK 400 

401 YASVIAKILKQDNSAYVVSGQSSAVNISMQAWSSLWPKERKRQRAFFLFG 450 
401 YASvIIkIlKQDNSAYW 450 

451 LELIVQUDIEATRTFFRTFFRLPTVM^GFLGSSLSSFDLVLFSMYMFVL 500 
451 LELIVQLdI^TRTFFRT^ 500 

501 APNSMRMSLVRHLLSDPSGAVMVKAYLER* 530 
501 AMsMfyiSLVmLL^DPS 530 

FIG. 24 B 
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F 16. 28 A 

GAP of: Arabidopsis epsilon cyclase to Lettuce epsilon cyclase 

Gap Weight: 12 Average Match: 2.912 

Length Weight: 4 Average Mismatch: -2.003 

Quality: 1837 Length: 534 

Ratio: 3.499' Gaps: 3 

Percent Similarity: 76.381 Percent Identity: 69.905 
Match display thresholds for the al ignment(s) : 
' = IDENTITY :• = 2 . = 1 



Arabidopsis x Lettuce 

• * 

1 MECVGARNF.AAMAVSTFPSW. . .SCRRKFPVVKRYSYRNIRFGLCSVRA 46 

III Nil . I MM I • • I' || ..|. : |: ' . 
1 MECFGARNMTATMAVFTCPRFTDCNIRHKFSLLKQRRFTNLSASSSLRQI 50 

47 SGGGSSGSESCVAVREDFADEEDFVKAGGSEILFVQMQQNFXIMDEQSKLV 96 

I I II I h 1 1 1 1 1 1 h I II II • I |: MM 

51 KCSAKSDRCVVDKQGISVADEEDYVKAGGSELFFVQMQRTKSMESQSKLS 100 
97 DKLPPISIGDGALDHVVIGCGPAGLALAAESAKLGLKVGLIGPDLPFTNN 146 

•II I II. II IIIIIIIIIIIIIIIMIIII IMMIIIMIM 

101 EKLAQIP I GNC I LD L VV I GCG PAG LALAAESAKLG VG L I GPD LP FTNN 150 

147 YGVWEDEFNDLGLQKCIEHVWRETIVYLDDDKPITIGRAYGRVSRRLLHE 196 

1 1 1 1 : 1 i I Ml: MM |::|:||||| II MIMIII I MM 
151 YGVWQDEFIGLGLEGCIEHSWKDTLVYLDDADPIRIGRAYGRVH^LLHE 200 

197 ELLRRCVESGVSYLSSKVDSITEASDGLRLVACDDNNAIPCRLATVASGA 246 

II MM II II MM 1 1 II-- MM -I |: |: I IMMIMMM 

201 ELLRRCVESGVSYLSSKVERITEAPNGYSLI ECEGNITI PCRLATVASGA 250 
247 ASGKLLQYEVGGPRVCVQTAYGVEVEVENSPYDPDQMVFMDYRDYTNEKV 296 

MM hll. MIMIIIMIhlMMI. Mill llllllll:. I 

251 ASGKFLEYELGGPRVCVQTAYGIEVEVENNPYDPDLMVFMDYRDFSKHKP 300 

297 RSLEAEYPTFLYAMPMTKSRLFFEETCtJ\SKDVMPFDLLlb'KLMLRLDTL 346 

MM- MUM I |. .::|||||||||:: |||.|||.||| II : 
301 ESLEAKYPTFLYV1W1SPTKIFFEETCLASREAMPFNLLKSKLMSRLKAM 350 
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FIG. 28B 



347 GI RI LKTYEEEWSYIPVGGSLPNTEQKNLAFGAAASMVHPATGYSVVRSL 396 

nil r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 

351 GI RITRTYEEEWSYI PVGGSLPNTEQKNLAFGAAASMVHPATGYSVVRSL 400 

397 SEAPKYASVIAEI LREETTKQINS N I SRQAWDTLWPPERKRQRAF 441 

MM ||. III. Ill:: -I:- I IIIMMMIM llllllll 
401 SEAPNYAAVIAKILRQDQSKEMISLGKYTNISKQAWETLWPLERKRQRAF 450 

442 FLFGU\LIVQFDTEGIRSFFRTFFRLPKWWQGFLGSTLTSGDLV 491 

Mill. II I II MIMMMIMM 11111- U IhMIII 

451 FLFGl^HIVUCDLEGTRTFFRTFFRLPKWMWWGFLGSSLSSTDLIIFALY 500 

492 MFVISPNNLRKGLINHLISDPTGATMIKTYLKV* 525 

lll|.|..|| I: IhlMllllhl II -'I 
501 MFVIAPHSLRMELVRHLLSDPTGATMVKAYLTI* 534 



.9963055AlJA> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/63055 




PCT/US99/12121 



SEQUENCE LISTING 



<110> CUNNINGHAM JR., FRANCIS X. 
SUN, ZAIREN 

<120> GENES OF CAROTENOID BIOSYNTHESIS AND METABOLISM AND • 
METHODS OF, USE THEREOF 

<130> 8172-9023 

<140> NOT YET ASSIGNED 
<141> 1999-06-02 

<150> 09/088, 724 
<151> 1998-0,6t02. # 

<150> 09/088,725 

<151> 1998-06-02 , , , . 

<160> 61 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 1860 

<212> DNA * 
<213> Arabidopsis thaliana 

<220> 
<221> CDS 

<222> (109) . . (1680) 
<400> 1 

ACAAAAGGAA ATAATTAGAT TCCTCTTTCT GCTTGCTATA CCTTGATAGA ACAATATAAC 60 

AATGGTGTAA GTCTTCTCGC TGTATTCGAA ATTATTTGGA GGAGGAAA ATG GAG TGT 117 

Met Glu Cys 
1 

GTT GGG GCT AGG AAT TTC GCA GCA ATG GCG GTT TCA ACA TTT CCG TCA 165 
Val Gly Ala Arg Asn Phe Ala Ala Met Ala Val Ser Thr Phe Pro Ser 
5 10 15 

TGG AGT TGT CGA AGG AAA TTT CCA GTG GTT AAG AGA TAC AGC TAT AGG 213 
Trp Ser Cys Arg Arg Lys Phe Pro Val Val Lys Arg Tyr Ser Tyr Arg 
20 25 30 35 

AAT ATT CGT TTC GGT TTG TGT AGT GTC AGA GCT AGC GGC GGC GGA AGT 261 
Asn lie Arg Phe Gly Leu Cys Ser Val Arg Ala Ser Gly Gly Gly Ser 
40 45 50 

TCC GGT AGT GAG AGT TGT GTA GCG GTG AGA GAA GAT TTC GCT GAC GAA 309 
Ser Gly Ser Glu Ser Cys Val Ala Val Arg Glu Asp Phe Ala Asp Glu 
55 60 65 

GAA GAT TTT GTG AAA GCT GGT GGT TCT GAG ATT CTA TTT GTT CAA ATG 357 
Glu Asp Phe Val Lys Ala Gly Gly Ser Glu lie Leu Phe Val Gin Met 
70 75 80 

CAG CAG AAC AAA GAT ATG GAT GAA CAG TCT AAG CTT GTT GAT AAG TTG 405 
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Gin Gin Asn Lys Asp Met Asp Glu Gin Ser Lys Leu Val Asp Lys Leu 
. 85 t . ' 90 95 

CCT CCT ATA TCA ATT GGT GAT GGT GCT TTG GAT CAT GTG GTT ATT GGT 4 53 

Pro Pro lie Ser lie Gly Asp Gly Ala Leu Asp His Val Val lie Gly • 
100 105 110 115 

TGT GGT CCT GCT GGT TTA GCC TTG GCT GCA GAA TCA GCT AAG CTT GGA 50.1 
Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala Lys Leu Gly 
120 125 . 130 

TTA AAA GTT GGA CTC ATT GGT CCA GAT CTT CCT TTT ACT AAC AAT TAC '54 9 

Leu Lys Val Gly Leu lie Gly Pro Asp Leu Pro Phe Thr Asn Asn Tyr 
135 140. 145 

GGT QTT JGG GAA GAT GAA TTC AAT GAT CTT GGG CTG CAA AAA TGT ATT 597 
Gly ValTrp Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin Lys Cys lie 
150 155 160 

GAG CAT GTT TGG AGA GAG ACT ATT GTG TAT CTG GAT GAT GAC AAG CCT 64 5 

Glu His Val Trp Arg Glu Thr lie Val Tyr Leu Asp Asp Asp Lys Pro 
165 170 * 17 5 

ATT ACC ATT GGC CGT GCT TAT GGA AGA GTT AGT CGA CGT TTG CTC CAT 693 
lie Thr lie Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg Leu Leu His 
180 185 190 195 

GAG GAG CTT 4 TTG AGG AGG TGT GTC GAG TCA GGT GTC TCG TAC CTT AGC 741 
Glu Glu Leu Leu Arg Arg Cys Val Glu Ser Gly Val Ser Tyr Leu Ser 
200 205 210 

TCG AAA GTT GAC AGC ATA ACA GAA GCT TCT GAT GGC. CTT AGA CTT GTT 78 9 

Ser Lys Val Asp Ser lie Thr Glu Ala Ser Asp Gly Leu Arg Leu Val 
215 220 225 

GCT TGT GAC GAC AAT AAC GTC ATT CCC TGC AGG CTT GCC ACT GTT GCT 837 
Ala Cys Asp Asp Asn Asn Val lie Pro Cys Arg Leu Ala Thr Val Ala 
230 235 240 

TCT GGA GCA GCT TCG GGA AAG CTC TTG CAA TAC GAA GTT GGT GGA CCT 885 
Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val Gly Gly Pro 
245 250 255 

AGA GTC TGT GTG CAA ACT GCA TAC GGC GTG GAG GTT GAG GTG GAA AAT 933 
Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val Glu Asn 
260 265 270 275 

AGT CCA TAT GAT CCA GAT CAA ATG GTT TTC ATG GAT TAC AGA GAT TAT .981 
Ser Pro Tyr Asp Pro Asp Gin Met Val Phe Met Asp Tyr Arg Asp Tyr 
280 285 290 

ACT AAC GAG AAA GTT CGG AGC TTA GAA GCT GAG TAT CCA ACG TTT CTG 102 9 

Thr Asn Glu Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro Thr Phe Leu 
295 300 305 

TAC GCC ATG CCT ATG ACA AAG TCA AGA CTC TTC TTC GAG GAG ACA TGT 107 7 

Tyr Ala Met Pro Met Thr Lys Ser Arg Leu Phe Phe Glu Glu Thr Cys 
310 315 320 

TTG GCC TCA AAA GAT GTC ATG CCC TTT GAT TTG CTA AAA ACG AAG CTC 1125 
Leu Ala Ser Lys Asp Val Met Pro Phe Asp Leu Leu Lys Thr Lys Leu 



1~ 
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325 330 \ • 335 

ATG TTA AGA TTA GAT ACA CTC GGA ATT CGA ATT CTA AAG ACT TAC GAA 1173 
Met Leu Arg Leu Asp Thr Leu Gly lie Arg lie Leu Lys Thr Tyr Glu 
340 345 350 355 

GAG GAG TGG TCC TAT ATC CCA GTT GGT GGT TCC TTG CCA AAC ACC *GAA 1221 
Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser Leu Pro Asn Thr Glu 
360 365 370 

CAA AAG AAT CTC GCC TTT GGT GCT GCC GCT AGC ATG GTA CAT CCC GCA 12 69 

Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His Pro Ala 
375 380 385 

ACA GGC TAT TCA GTT GTG AGA TCT TTG TCT GAA GCT CCA AAA TAT GCA 1317 
Thr Gly Tyr Ser yal Val Arg Ser Leu Ser Glu Ala Pro Lys Tyr Ala 
390 395 400 

TCA GTC ATC GCA GAG ATA CTA AGA GAA GAG ACT ACC AAA CAG ATC AAC ' 1365 
Ser Val lie Ala Glu lie Leu Arg Glu Glu Thr Thr Lys Gin lie Asn 
405 410 415 

AGT AAT ATT TCA AGA CAA GCT TGG GAT ACT TTA .TGG CCA CCA GAA AGG 14-13 
Ser Asn lie Ser Arg Glh Ala Trp Asp Thr Leu Trp Pro Pro* Glu Arg • 
420 425 430 " 435 

AAA AGA CAG AGA GCA ,^TC TTT CTC TTT GGT CTT GCA CTC ATA GTT CAA 14 61 

Lys Arg Gin Arg Ala Phe Phe Leu Phe Gly Leu Ala Leu lie Val Gin 
440 445 450 

TTC GAT ACC GAA GGC ATT AGA AGC TTC TTC CGT ACT TTC TTC CGC CTT 150 9 

Phe Asp Thr Glu Gly lie Arg Ser Phe Phe Arg Thr Phe Phe Arg Leu 
4 55 4 60 4 65 

CCA AAA TGG ATG TGG CAA GGG TTT CTA GGA TCA ACA TTA ACA TCA GGA 1557 
Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Thr Leu Thr Ser Gly 
470 475 480 

GAT CTC GTT CTC TTT GCT TTA TAC ATG TTC GTC ATT. TCA CCA AAC AAT 1605 
Asp Leu Val Leu JJhe Ala Leu Tyr Met Phe Val lie Ser Pro Asn Asn 
485 • ' 490 495 

TTG AGA AAA GGT CTC ATC AAT CAT CTC ATC TCT GAT CCA ACC GGA GCA 1653 
Leu Arg Lys Gly Leu lie Asn His Leu lie Ser Asp Pro Thr Gly Ala 
500 505 510 515 

ACC ATG ATA AAA ACC TAT CTC AAA GTA TGATTTACTT ATCAACTCTT 1700 
Thr Met lie Lys Thr Tyr Leu Lys Val 
520 

AGGTTTGTGT ATATATATGT TGATTTATCT GAATAATCGA TCAAAGAATG GTATGTGGGT 1760 

TACTAGGAAG T TGG AAACAA ACATGTATAG AATCTAAGGA GTGATCGAAA TGGAGATGGA 1820 

AACGAAAAGA AAAAAATCAG TCTTTGTTTT GTGGTTAGTG 18 60 



<210> 2 
<211> 524 
<212> PRT 

<213> Arabidopsis thaliana 



3 
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<400> 2 

Met Glu Cys Val Gly Ala Arg Asn Phe Ala Ala Met Ala Val Ser Thr 

1 ' . ' ' 5 10 15 

Phe Pro Ser Trp Ser Cvs Arg Arg Lys Phe Pro Val Val Lys Arg Tyr 
20 25 30 

Ser Tyr Arg Asn lie Arg Phe Gly Leu Cys Ser Val Arg Ala Ser Gly 
35 4 0 4 5 

Gly Gly Ser Ser Gly Ser Glu Ser Cys Val Ala Val Arg Glu Asp Phe 
50 5 5 60 

Ala Asp Glu Glu Asp Phe Val Lys Ala Gly Gly Ser Glu lie Leu Phe 
65 70 75 80 

Val Gin Met Gin Gin Asn Lys Asp Met Asp Glu Gin Ser Lys Leu Val 

85 90 J 95 

Asp Lys Leu Pro Pro lie Ser lie Gly Asp Gly Ala Leu Asp His Val 
100 105 110 

Val lie Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala 
115 120 125 

Lys Leu Gly Leu Lys Val Gly Leu lie Gly Pro Asp Leu Pro Phe Thr 
130 135 140 

Asn Asn Tyr Gly Val Trp Glu Asp Glu Phe Asn Asp' Leu Gly Leu Gin 
145 150 155 160 

Lys Cys lie Glu His Val Trp Arg Glu Thr lie Val Tyr Leu Asp Asp 
165 170 175 

Asp Lys Pro lie Thr lie Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg 
180 185 190 

Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu Ser Gly Val Ser 
195 200 205 

Tyr Leu Ser Ser Lys Val Asp Ser lie Thr Glu Ala Ser Asp Gly Leu 
210 215 220 

Arg Leu Val v Ala Cys Asp Asp Asn Asn Val lie Pro Cys Arg Leu Ala 
225 . 230 235 240 

Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val 
245 250 255 

Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu 
260 265 270 

Val Glu Asn Ser Pro Tyr Asp Pro Asp Gin Met Val Phe Met Asp Tyr 
275 280 285 

Arg Asp Tyr Thr Asn Glu Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro 
290 295 300 

Thr Phe Leu Tyr Ala Met Pro Met Thr Lys Ser Arg Leu Phe Phe Glu 
305 310 315 320 



9963055A1 IA> 



WO 99/63055 PCT/US99/12121 

• * ,*' 

Glu Thr Cys Leu Ala Ser Lys Asp Val Met Pro* Phe Asp Leu Leu Lys 
325 330 335 

Thr Lys Leu Met Leu Arg Leu Asp Thr Leu Gly lie Arg lie Leu Lys 
340 345 350 

Thr Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser Leu 'Pro 
355 . 360 365 

Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val 
370 375 380 

His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro 
385 390 t 395 400 

Lys Tyr Ala S.er Val He Ala Glu He Leu Arg Glu Glu Thr Thr Lys 
405 410 415 

Gin He Asn Ser Asn He Ser, Arg Gin Ala Trp Asp Thr Leu Trp Pro 
420 425 ' 430 

Pro Glu Arg Lys Arg Gin £rg Ala Phe Phe Leu Phe 'Gly, Leu Ala Leu 
435 440 , 445' 

He Val Gin Phe Asp Thr Glu Gly He Arg Ser Phe Phe Arg Thr Phe 
4 50 4 55 4 60 

• * * 
Phe Arg Leu Pro tys Trp Met Trp Gin Gly Phe Leu Gly Ser Thr Leu 
465 470 ~ 475 480 

Thr Ser Gly Asp Leu Val Leu Phe Ala Leu Tyr Met Phe Val He Ser 
4 85 4 90 4 95 

Pro Asn Asn Leu Arg Lys Gly Leu He Asn His Leu He Ser Asp Pro 
500 505 510 

Thr Gly Ala Thr Met He Lys Thr Tyr Leu Lys Val 
515 520 

<210> 3 

<211> 956 • ' 

<212> DNA 

<213> Arabidopsis thaliana 
<400> 3 

GCTCTTTCTC CTCCTCCTCT ACCGATTTCC GACTCCGCCT CCCGAAATCC TTATCCXSGAT 60 

TCTCTCCGTC TCTTCGATTT AAACGCTTTT CTGTCTGTTA CGTCGTCGAA GAAOGGAGAC 120 

AGAATTCTCC GATTGAGAAC GATGAGAGAC CGGAGAGCAC GAGCTCCACA AACGCTATAG 180 

ACGCTGAGTA TCTGGCGTTG CGTTTGGCGG AGAAATTGGA GAGGAAGAAA TCGGAGAGGT 240 

CCACTTATCT AATCGCTGCT ATGTTGTCGA GCTTTGGTAT CACTTCTATG GCTGTTATGG 300 

CTGTTTACTA CAGATTCTCT TGGCAAATGG AGGGAGGTGA GATCTCAATG TTGGAAATGT 360 

TTGGTACATT TGCTCTCTCT GTTGGTGCTG CTGTTGGTAT GGAATTCTGG GCAAGATGGG 4 20 

CTCATAGAGC TCTGTGGCAC GCTTCTCTAT GGAATATGCA TGAGTCACAT CACAAACCAA 4 80 
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GAGAAGGACC 


GTTTGAGCTA 


AACGATGTTT 


TTGCTATAGT 


GAACGCTGGT 


CCAGCGATTG 


540 


GTCTCCTCTC 


TTATGGATTC 


TTCAATAAAG 


GACTCGTTCC 


TGGTCTCTGC 


• 

TTTGGCGCCG 


600 


GGTTAGGCAT 


AACGGTGTTT 


GGAATCGCCT 


ACATGTTTGT 


CCACGATGGT 


CTCGTGCACA 


660 


AGCGTTTCCC 


TGTAGGTCCC 


ATCGCCGACG 


TCCCTTACCT 


CCGAAAGGTC 


GCCGCCGCTC 


720 


ACCAGCTACA 


TCACACAGAC 


AAGTTCAATG 


GTGTACCATA 


TGGACTGTTT 




7 ft fi 


AGGAATTGGA 


AGAAGTTGGA 


GGAAATGAAG 


AG T T AG AT AA 


GGAGATTAGT 


CGGAGAATCA 


840 


AATCATACAA 


AAAGGCCTCG 


GGCTCCGGGT 


CG AG TTCG AG 


TTCTTGACTT 


TAAACAAGTT 


900 


TTAAATCCCA 


AATTCTTTTT 


TTGTCTTCTG 


TCATTATGAT 


CATCTTAAGA 


CGGTCT 


956 



<210>' 4 
<211> 294 
<212> PRT * 

<213> Arabidopsis thaliana 
<400> 4, 

Ser Phe Ser Ser Ser Ser Thr Asp Phe Arg Leu Arg Leu Pro Lys Ser 
1 5 10 15 

Leu Ser Gly Phe Ser Pro Ser Leu Arg Phe Lys Arg Phe Ser Val Cys 
20 25 ~ 30 

Tyr Val Val Glu Glu Arg Arg Gin Asn Ser Pro lie* Glu Asn Asp Glu 
35 4 0 4 5 

Arg Pro Glu Ser Thr Ser Ser Thr Asn Ala lie Asp Ala Glu Tyr Leu 
50 55 60 • " 

Ala Leu Arg Leu Ala Glu Lys Leu Glu Arg Lys Lys Ser Glu Arg Ser 
65 70 75 " 80 

Thr Tyr Leu lie Ala Ala Met Leu Ser Ser Phe Gly lie Thr Ser Met 
85 90 95 

Ala Val Met Ala Val Tyr Tyr Arg Phe Ser Trp Gin Met Glu Gly Gly 
100 105 110 

Glu lie Ser' Met Leu Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly 
115 120 125 

Ala Ala Val Gly Met Glu Phe Trp Ala Arg Trp Ala His Arg Ala Leu 
130 135 * * 140 

Trp His Ala Ser Leu Trp Met Asn His Glu Ser His His Lys Pro Arg 
145 150 155 160 

Glu Gly Pro Phe Glu Leu Asn Asp Val Phe Ala lie Val Asn Ala Gly 
165 170 175 

Pro Ala lie Gly Leu Leu Ser Tyr Gly Phe Phe Asn Lys Gly Leu Val 
180 185 190 

Pro Gly Leu Cys Phe Gly Ala Gly Leu Gly lie Thr Val Phe Gly lie 
195 200 205 
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Ala Tyr Met Phe Val His Asp Gly Leu 'VaO. His Lys Arg Phe Pro Val 
210 215 220 

Gly Pro lie Ala Asp Val Pro Tyr Leu Arg Lys Val Ala Ala Ala His 
225 230 235 240 

Gin Leu His His Thr Asp Lys Phe Asn Gly Val Pro Tyr Gly Leu 'Phe 
245 250 " 255 

Leu Gly Pro Lys Glu Leu Glu Glu Val Gly Gly Asn Glu Glu Leu Asp 
260 265 270 

Lys Glu lie Ser Arg Arg lie Lys Ser Tyr Lys Lys Ala Ser Gly Ser 
275 280 ( 285 

Gly Ser Ser Ser §er Ser 
290 

<210> 5 . . , , - , 

<211> 162 
<212> PRT 

<213> Alicalgenes sp. . , 

<400> 5 ' 
Met Thr Gin Phe Leu lie Val Val Ala Thr Val Leu Val Met Glu Leu 
1 5 10 15 

'» ' ' 
Thr Ala Tyr Ser Val His Arg Trp lie Met His Gly Pro Leu Gly Trp 
20 ' 25 30 

Gly Trp His Lys Ser His His Glu Glu His Asp His Ala Leu Glu Lys 
35 40 ( 45 

Asn Asp Leu Tyr Gly Val Val Phe Ala Val Leu Ala Thr lie Leu Phe 
50 55 60 

Thr Val Gly Ala Tyr Trp Trp Pro Val Leu Trp Trp He Ala Leu Gly 
65 70 75 80 

Met Thr Val Tyr G^y Leu He Tyr Phe He Leu His Asp Gly Leu Val 
• *85 90 95 

His Gin Arg Trp Pro Phe Arg Tyr He Pro Arg Arg Gly Tyr Phe Arg 
100 105 110 

Arg Leu Tyr Gin Ala His Arg Leu His His Ala Val Glu Gly Arg Asp 
115 120 125 

His Cys Val Ser Phe Gly Phe lie Tyr Ala Pro Pro Val Asp Lys Leu 
130 135 14 0 

Lys Gin Asp Leu Lys Arg Ser Gly Val Leu Arg Pro Gin Asp Glu Arg 
145 150 155 160 

Pro Ser 



<210> 6 
<211> 175 
<212> PRT 

<213> Erwinia herbicola 
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<400> 6 

Met, Leu Asn Ser Leu lie Val lie Leu Ser Val lie Ala Met Glu Gly ' 
1 5 10 15 

lie Ala Ala Phe Thr His Arg Tyr lie Met His Gly Trp Gly Trp Arg ♦ 
20 25 30 

Trp His Glu Ser His His Thr Pro Arg Lys Gly Val Phe Glu Leu Asn 
35 40 45 

Asp Leu Phe Ala Val Val Phe Ala Gly Val Ala lie Ala Leu lie Ala 
50 55 60 

Val Gly Thr Ala Gly Val Trp Pro Leu Gin Trp lie Gly Cys Gly Met 
65 7 0 75 80 

Thr Val^Tyr Gly Leu Leu Tyr Phe Leu Val His Asp Gly Leu Val His 
85 90 95 

Gin Arg Trp Pro Phe His Trp lie Pro Arg Arg Gly Tyr Leu Lys Arg 
100 105 110 

Leu Tyr Val Ala His Arg Leu His His Ala Val Arg Gly Arg Glu Gly 
115 120 125 

Cys Val Ser Phe Gly Phe lie Tyr Ala Arg Lys Pro Ala Asp Leu Gin 
130 135 ^ 140 

Ala lie Leu Arg Glu Arg His Gly Arg Pro Pro Lys Arg Asp Ala Ala 
145 150 * 155 160 

Lys Asp Arg Pro Asp Ala Ala Ser Pro Ser Ser Ser Ser Pro Glu 
165 170 175 

<210> 7 
<211> 175 
<212> PRT 

<213> Erwinia uredovora 
<400> 7 

Met Leu "Trp lie Trp Asn Ala Leu He Val Phe Val Thr Val He Gly 
1 5 10 15 

Met Glu Val He Ala Ala Leu Ala His Lys Tyr He Met His Gly Trp 
20 25 30 

Gly Trp Gly Trp His Leu Ser His His Glu Pro Arg Lys Gly Ala Phe 
35 40 45 

Glu Val Asn Asp Leu Tyr Ala Val Val Phe Ala Ala Leu Ser He Leu 
50 55 60 

Leu lie Tyr Leu Gly Ser Thr Gly Met Trp Pro Leu Gin Trp He Gly 
65 70 75 80 

Ala Gly Met Thr Ala Tyr Gly Leu Leu Tyr Phe Met Val His Asp Gly 

85 90 95 

Leu Val His Gin Arg Trp Pro Phe Arg Tyr lie Pro Arg Lys Gly Tyr 
100 105 110 
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Leu Lys Arg Leu Tyr Met Ala His Arg 'Met His His Ala Val Arg Gly 
115 120 • 125 

Lys Glu Gly fcys Val Ser Phe Gly Phe Leu Tyr Ala Pro Pro Leu Ser 
130 135 140 

Lys Leu Gin Ala Thr Leu Arg Glu Arg His Gly Ala Arg Ala Gly. Ala 
145 , 150 155 160 

Ala Arg Asp Ala Gin Gly Gly Glu Asp Glu Pro Ala Ser Gly Lys 
165 170 175 

<210> 8 
<211> 162 
<212> PRT 

<213> Agrobacterium aurianticum 
<400> 8 

Met Thr Asn Phe Leu lie .Val Val Ala Thr Val. Leu ,Val Met Glu Leu ■ 
1 5 10 ' 15 

Thr Ala Tyr Ser Val His Arg Trp lie Met His Gly Pro Leu Gly Trp 
20 25 , 30 

Gly Trp His Lys Ser His His Glu Glu His Asp His Ala Leu Glu Lys 
35 40 45 

♦ * ' 
Asn Asp Leu Tyr tly Leu Val Phe Ala Val lie Ala Thr Val Leu Phe 
50 55 60 

Thr Val Gly Trp lie Trp Ala Pro Val Leu Trp Trp lie Ala Leu Gly 
65 70 75 80 

Met Thr Val Tyr Gly Leu lie Tyr Phe Val Leu His Asp Gly Leu Val 
85 90 95 

His Trp Arg Trp Pro Phe Arg Tyr lie Pro Arg Lys Gly Tyr Ala Arg 
100 105 110 

Arg Leu Tyr Gin Ala His Arg Leu His His Ala Val Glu Gly Arg Asp 
115 . *'* 120 125 

His Cys Val Ser Phe Gly Phe lie Tyr Ala Pro Pro Val Asp Lys Leu 
130 135 140 

Lys Gin Asp Leu Lys Met Ser Gly Val Leu Arg Ala Glu Ala Gin Glu 
145 150 155 160 

Arg Thr 



<210> 9 
<211> 954 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 9 

CCACGGGTCC GCCTCCCCGT TTTTTTCCGA TCCGATCTCC GGTGCCGAGG ACTCAGCTGT 60 
TTGTTCGCGC TTTCTCAGCC GTCACCATGA CCGATTCTAA CGATGCTGGA ATGGATGCTG 120 
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TTPAf^AGAPG 

* 


ACTCATGTTT 


GAAGACGAAT 


GCATTCTCGT 


TGATGAAAAT 


AATCGTGTGG 

• 


180 


1 ouon^n 1 \jr\ 


CACTAAGTAT 


AACTGTCATC 


TGATGGAAAA 


GATTGAAGCT 


GAGAATTTAC 


2^0 




TTTPAGTGTG 

III V* .r-iO lulu 


TTTTTATTCA 


ACTCCAAGTA 


TGAGTTGCTT 


CTCCAGCAAC 


300 




AAAGHTTACT 


TTCCCACTTG 


TGTGGACAAA 


CACTTGTTGC 


AGCCATCCTC 


360 


TTTACf^TGA 


ATCCGAGCTT 


ATTGAAGAGA 


ATGTGCTTGG 


TGTAAGAAAT 


GCCGCACAAA 


420 


GGAAGCTTTT 


CGATGAGCTC 


GGTATTGTAG 


CAGAAGATGT 


ACCAGTCGAT 


GAGTTCACTC 


480 
• 


CCTTGGGACG 


CATGCTTTAC 


AAGGCACCTT 


CTGATGGGAA 


ATGGGGAGAG 


CACGAAGTTG 


540 


AC TAT C TACT 


CTTCATCGTG 


CGGGATGTGA 


AGCTTCAACC 


AAACCCAGAT 


GAAGTGGCTG 


600 


APATCAAGTA 


CGTGAGCAGG 

X^VJ J, WA*\J V« *W w 


GAAGAGCTTA 


AGGAGCTGGT 


GAAGAAAGCA 


GATGCTGGCG 


660 


ATGAAGCTGT 


GAAACTATCT 


CCATGGTTCA 


GATTGGTGGT 


GGATAATTTC 


TTGATGAAGT 


720 


GGTGGGATCA 


TGTTGAGAAA 


GGAACTATCA 


CTGAAGCTGC 


AGACATGAAA 


ACCATTCACA 


780 


AGCTCTGAAC 


TTTCCATAAG 


TTTTGGATCT 


TCCCCTTCCC 


ATAATAAAAT 


TAAGAGATGA 


840 


GACTTTTATT 


GATTACAGAC 


AAAACTGGCA 


ACAAAATCTA 


TTCCTAGGAT 


TTTTTTTTGC 


900 


TTTTTATTTA 


CTTTTGATTC 


ATCTCTAGTT 


TAGTTTTCAT 


CTTAAAAAAA 


AAAA 


954 



<210> 10 
<211> 996 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 10 

CACCAATGTC TGTTTCTTCT TTATTTAATC TCCCATTGAT TCGCCTCAGA TCTCTCGCTC 60 

TTTCGTCTTC TTTTTCTTC'T TTCCGATTTG CCCATCGTCC TCTGTCATCG ATTTCACCGA 120 

GAAAGTTACC GAATTTTCGT GCTTTCTCTG GTACCGCTAT G ACAG AT AC T AAAGATGCTG 180 

GTATGGATGC TGTTCAGAGA CGTCTCATGT TTGAGGATGA ATGCATTCTT GTTGATGAAA 24 0 

CTGATCGTGT TGTGGGGCAT GTCAGCAAGT ATAATTGTCA TCTGATGGAA AATATTGAAG 300 

CCAAGAATTT GCTGCACAGG GCTTTTAGTG TATTTTTATT CAACTCGAAG TATGAGTTGC 360 

TTCTCCAGCA AAGGTCAAAC ACAAAGGTTA CGTTCCCTCT AGTGTGGACT AACACTTGTT 420 

GCAGCCATCC TCTTTACCGT GAATCAGAGC TTATCCAGGA CAATGCACTA GGTGTGAGGA 4 80 

ATGCTGCACA AAGAAAGCTT CTCGATGAGC TTGGTATTGT AGCTGAAGAT GTACCAGTCG 54 0 

ATGAGTTCAC TCCCTTGGGA CGTATGCTGT ACAAGGCTCC TTCTGATGGC AAATGGGGAG 600 

AGCATGAACT TGATTACTTG CTCTTCATCG TGCGAGACGT GAAGGTTCAA CCAAACCCAG 660 

ATGAAGTAGC TGAGATCAAG TATGTGAGCC GGGAAGAGCT GAAGGAGCTG GTGAAGAAAG 720 

CAGATGCAGG TGAGGAAGGT TTGAAACTGT CACCATGGTT CAGATTGGTG GTGGACAATT 780 

10 
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TCTTGATGAA GTGGTGGGAT CATGTTGAGA AAGGAACTTT GGTTGAAGCT ATAGACATGA 84 0 

AAACCATCCA CAAACTCTGA ACATCTTTTT TTAAAGTTTT TAAATCAATC AACTTTCTCT 900 

TCATCATTTT TATCTTTTCG ATGATAATAA TTTGGGATAT GTGAGACACT TACAAAACTT 960 

CCAAGCACCT CAGGCAATAA TAAAGTTTGC GGCCGC * 996 

<210> 11 
<211> 1165 
<212> DNA 

<213> Haematococcus pluvialis 
<400> 11 

CTCGGTAGCT GQCCACAATC GCTATTTGGA ACCTGGCCCG GCGGCAGTCC GATGCCGCGA 60 

TGCTTCGTTC GTTGCTCAGA GGCCTCACGC ATATCCCCCG CGTGAACTCC GCCCAGCAGC 120 

CCAGCTGTGC ACACGCGCGA CTCCAGTTTA AGCTCAGGAG CATGCAGATG ACGCTCATGC 180 

AGCCCAGCAT CTCAGCCAAT CTGTCGCGCG CCGAGGACCG CACAGACCAC ATGAGGGGTG 24 0 

CAAGCACCTG GGCAGGCGGG CAGTCGCAGG ATGAGCTGAT GCTGAAGGAC GAGTGCATCT 300 

TGGTGGATGT T GAG G AC AAC ATCACAGGCC ATGCCAGCAA GCTGGAGTGT CACAAGTTCC 360 

TACCACATCA GCC TGCAGGC CTGCTGCACC GGGCCTTCTC TGTGTTCCTG TTTGACGATC 4 20 

AGGGGCGACT GCTGCTGCAA CAGCGTGCAC GCTCAAAAAT CACCTTCCCA AGTGTGTGGA 4 80 

CGAACACCTG CTGCAGCCAC CCTTTACATG GGCAGACCCC AGATGAGGTG GACCAACTAA 54 0 

GCCAGGTGGC CGACGGAACA GTACCTGGCG CAAAGGCTGC TGCCATCCGC AAGTTGGAGC 600 

ACGAGCTGGG GATACCAGCG CACCAGCTGC CGGCAAGCGC GTTTCGCTTC CTCACGCGTT 660 

TGCACTACTG TGCCGCGGAC GTGCAGCCAG CTGCGACACA ATCAGCGCTC TGGGGCGAGC 720 

ACGAAATGGA CTACATCTTG TTCATCCGGG CCAACGTCAC CTTGGCGCCC AACCCTGACG 780 

AGGTGGACGA AGTCAGGTAC GTGACGCAAG AGGAGCTGCG GCAGATGATG CAGCCGGACA 84 0 

ACGGGCTGCA ATGGTCGCCG TGGTTTCGCA TCATCGCCGC GCGCTTCCTT GAGCGTTGGT 900 

GGGCTGACCT GGACGCGGCC CTAAACACTG ACAAACACGA GGATTGGGGA ACGGTGCATC 960 

AC AT C AAC G A AGCGTGAAAG CAGAAGCTGC AGGATGTGAA GACACGTCAT GGGGTGGAAT 1020 

TGCGTACTTG GCAGCTTCGT ATCTCCTTTT TCTGAGACTG AACCTGCAGT CAGGTCCCAC 1080 

AAGGTCAGGT AAAATGGCTC GATAAAATGT ACCGTCACTT TTTGTCGCGT ATACTGAACT 114 0 

CCAAGAGGTC AAAAAAAAAA AAAAA 1165 

<210> 12 
<211> 1135 
<212> DNA 

<213> Haematococcus pluvialis 
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<400> 12 

CTCGGTAGCT, GGCCACAATC 


GCTATTTGGA 


ACCTGGCCCG 


GCGGCAGTCC 


gatgccgcga' 


60 


TGCTTCGTTC 


GTTGCTCAGA 


GGCCTCACGC 


ATATCCCGCG 


CGTGAACTCC 


GCCCAGCAGC 


120 


CCAGCTGTGC 


ACACGCGCGA 


CTCCAGTTTA 


AGCTCAGGAG 


CATGCAGCTG 


CTTTCCGAGG 


180 


ACCGCACAGA 


CCACATGAGG 


GGTGCAAGCA 


CCTGGGCAGG 


CGGGCAGTCG 


CAGGATGAGC 


24 0 


TGATGCTGAA 


GGACGAGTGC 


ATCTTGGTAG 


ATGTTGAGGA 


CAACATCACA 


GGCCATGCPA 


300 


GCAAGCTGGA 


GTGTCACAAG 


TTCCTACCAC 


ATCAGCCTGC 


AGGCCTGCTG 


CACCGGGCCT 


♦ 360 


TCTCTG^GTT 


CCTGTTTGAC 


GATCAGGGGC 


GACTGCTGCT 


GCAACAGCGT 


GCACGCTCAA 


420 


AAATCACCTT 


CCCAAGTGTG 


TGGACGAACA 


CCTGCTGCAG 


CCACCCTTTA 


CATGGGCAGA 


480 


CCCCAGATGA 


GGTGGACCAA 


CTAAGCCAGG 


TGGCCGACGG 


AACAGTACCT 


GGCGCAAAGG 


540 


CTGCTGCCAT 


CCGCAAGTTG 


GAGCACGAGC 


TGGGGATACC 


AGCGCACCAG 


CTGCCGGCAA 


600 


GCGCGTTTCG 


CTTCCTCACG 


CGTTTGCACT 


ACTGTGCCGC 


GGACGTGCAG 


CCAGCTGCGA 


660 


CACAATCAGC 


GCTCTGGGGC 


GAGCACGAAA 


TGGACTACAT 


CTTGTTCATC 


CGGGCCAACG 


720 


TCACCTTGGC 


GCCCAACCCT 


GACGAGGTGG 


ACGAAGTCAG 


GTACGTGACG 


CAAGAGGAGC 


780 


TGCGGCAGAT ' 


GATGCAGCCG 


GACAACGGGC 


TTCAATGGTC 


GCCGTGGTTT 


CGCATCATCG 


840 


CCGCGCGCTT 


CCTTGAGCGT 


TGGTGGGCTG 


ACCTGGACGC 


GGCCCTAAAC 


AC T G AC AAAC 


900 


ACGAGGATTG 


GGGAACGGTG 


CATCACATCA 


ACGAAGCGTG 


AAGGCAGAAG 


CTGCAGGATG 


960 


TGAAGACACG 


TCATGGGGTG 


GAATTGCGTA 


CTTGGCAGCT 


TCGTATCTCC 


TTTTTCTGAG 


1020 


ACTGAACCTG 


CAGAGCTAGA 


GTCAATGGTG 


CATCATATTC 


ATCGTCTCTC 


TTTTGTTTTA 


1080 


GACTAATCTG 


TAGCTAGAGT 


CACTGATGAA 


TCCTTTACAA 


CTTTCAAAAA 


AAAAA 


1135 


<210> 13 
<211>. 960 
<212> DNA " 

<213> Tagetes erecta 












<400> 13 
CCAAAAACAA 


CTCAAATCTC 


CTCCGTCGCT 


CTTACTCCGC 


CATGGGTGAC 


GACTCCGGCA 


60 


TGGATGCTGT 


TCAGCGACGT 


CTCATGTTTG 


ACGATGAATG 


CATTTTGGTG 


GATGAGTGTG 


120 


ACAATGTGGT 


GGGACATGAT 


ACCAAATACA 


ATTGTCACTT 


GATGGAGAAG 


AT T G AAAC AG 


180 


GTAAAATGCT 


GCACAGAGCA 


TTCAGCGTTT 


TTCTATTCAA 


TTCAAAATAC 


GAGTTACTTC 


240 


TTCAGCAACG 


GTCTGCAACC 


AAGGTGACAT 


TTCCTTTAGT 


ATGGACCAAC 


ACCTGTTGCA 


300 


GCCATCCACT 


C T AC AG AG AA 


TCCGAGCTTG 


TTCCCGAAAC 


G CC T GAG AG A 


ATGCTGCACA 


360 


GAGGANNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


420 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


480 
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NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN * NNNNNNNNNN 


NNNNNNNNNN 


540 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


600 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


660 


NNNNNNNNNN 


NNNNNNNNNN 


TCATGTGCAA 


AAGGGTACAC 


TCACTGAATG 


CAATTTGATA 


720 


TGAAAACCAT 


ACACAAGCTG 


ATATAGAAAC 


ACACCCTCAA 


CCGAAAAGCA 


AGCCTAATAA 


780 


TTCGGGTTGG 


GTCGGGTCTA 


CCATCAATTG 


TTTTTTTCTT 


TTAACAACTT 


TTAATCTCTA 


840 


TTTGAGCATG 


TTGATTCTTG 


TCTTTTGTGT 


GTAAGATTTT 


GGGTTTCGTT 


TCAGTTGTAA 


900 


TAATGAACCA 


TTGATGGTTT 


GCAATTTCAA 


GTTCCTATCG 


ACATGTAGTG 


ATCTAAAAAA 


960 



<210> 14 

<211> 305 . , , • 

<212> PRT 

<213> Haematococcus pluvialis 
<400> 14 

Met Leu Arg Ser Leu Leu' Arg Gly Leu Thr His lie Pro Arg Val Asn 
15 10 15 

Ser Ala Gin Gin Pro Ser Cys Ala His Ala Arg' Leu Gin Phe Lys Leu 
20 / 1 25 30 

Arg Ser Met Gin Met Thr Leu Met Gin Pro Ser lie Ser Ala Asn Leu 
35 40 45 

Ser Arg Ala Glu Asp Arg Thr Asp His Met Arg Gly Ala Ser Thr Trp 
50 55 60 

Ala Gly Gly Gin Ser Gin Asp Glu Leu ■ Met Leu Lys Asp Glu Cys lie 
65 70 75 80 

Leu Val Asp Val Glu Asp Asn lie Thr Gly His Ala .Ser Lys Leu Glu 
§?.. 90 - 95 

Cys His Lys Phe Leu Pro His Gin Pro Ala Gly Leu Leu His Arg Ala 
100 105 110 

Phe Ser Val Phe Leu Phe Asp Asp Gin Gly Arg Leu Leu Leu Gin Gin 
115 120 125 

Arg Ala Arg Ser Lys lie Thr Phe Pro Ser Val Trp Thr Asn Thr Cys 
130 135 140 

Cys Ser His Pro Leu His Gly Gin Thr Pro Asp Glu Val Asp Gin Leu 
145 150 155 160 

Ser Gin Val Ala Asp Gly Thr Val Pro Gly Ala Lys Ala Ala Ala lie 
165 170 175 

Arg Lys Leu Glu His Glu Leu Gly lie Pro Ala His Gin Leu Pro Ala 
180 185 190 

Ser Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cys Ala Ala Asp Val 
195 200 205 



i2> 
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Gin Pro Ala Ala Thr Gin Ser Ala Leu Trp Gly Giu His Glu Met Asp 
♦210 , • • ' 215 220 

Tyr lie Leu Phe lie Arg Ala Asn Val Thr Leu Ala- Pro Asn Pro Asp 

225 230 235 240 * 

Glu' Val Asp Glu Val Arg Tyr Val Thr Gin Glu Giu Leu Arg .Gin Met 
245 250 255 

Met Gin Pro Asp Asn Glv Leu Gin Trp Ser Pro Trp Phe Arg lie lie 
260 265 270 

• * 

Ala Ala Arg Phe Leu Glu Arg Trp Trp Ala Asp Leu Asp Ala Ala Leu 
275 280 , 285 

Asn Thr Asp Lys His Glu Asp Trp Gly Thr Val His His lie Asn Glu 
290 295 300 

Ala 
305 

<210> is 
<211> 293 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 15 

Met Leu Arg Ser Leu Leu Arg Gly Leu Thr His lie Pro Arg Val Asn 
15 10 15 

Ser Ala Gin Gin Pro Ser Cys Ala His Ala Arg Leu Gin Phe Lys Leu 
20 25 30 

Arg Ser Met Gin Leu Leu Ser Glu Asp Arg Thr Asp His Met Arg Gly 
35 4 0 4 5 

Ala Ser Thr Trp Ala Gly Gly Gin Ser Gin Asp Glu Leu Met Leu Lys 
50 55 60 

Asp Glu „Cys lie Leu Val Asp Val Glu Asp Asn lie Thr Gly His Ala 
65 * 70 75 80 

Ser Lys Leu Glu Cys His Lys Phe Leu Pro His Gin Pro Ala Gly Leu 
85 90 95 

Leu His Arg Ala Phe Ser Val Phe Leu Phe Asp Asp Gin Gly Arg Leu 
100 105 110 

Leu Leu Gin Gin Arg Ala Arg Ser Lys lie Thr Phe. Pro Ser Val Trp 
115 - 120 125 

Thr Asn Thr Cys Cys Ser His Pro Leu His Gly Gin Thr Pro Asp Glu 
130 135 ~ 140 

Val Asp Gin Leu Ser Gin Val Ala Asp Gly Thr Val Pro Gly Ala Lys 
145 150 155 160 

Ala Ala Ala lie Arg Lys Leu Glu His Glu Leu Gly lie Pro Ala His 
165 170 175 

Gin Leu Pro Ala Ser Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cys 
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180 185 • ' ' ■ 190 

Ala Ala Asp Val Gin Pro Ala Ala Thr Gin Ser Ala Leu Trp Gly Glu 
195 ' 200 205 

His Glu Met Asp Tyr He Leu Phe He Arg Ala Asn Val Thr Leu Ala 
210 215 220 

Pro Asn Pro Asp Glu Val Asp Glu Val Arg Tyr Val Thr Gin Glu Glu 
225 230 235 240 

Leu Arg Gin Met Met Gin Pro Asp Asn Gly Leu Gin Trp Ser Pro Trp 
245 250 ^ 255 

Phe Arg He He Ala Ala Arg Phe Leu Glu' Arg Trp Trp Ala Asp Leu 
26.0 tt 2 65 . 270 

Asp Ala Ala Leu Asn Thr Asp Lys His Glu Asp Trp Gly Thr Val His 

275 , . . 280 , . 285 

His He Asn Glu Ala 

290 , ' 

<210> 16 . 

<211> 284 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 16 

Met Ser Val Ser Ser Leu Phe Asn Leu Pro Leu He Arg Leu Arg Ser 
1 5 10 15 

Leu Ala Leu Ser Ser Ser Phe Ser Ser Phe Arg' Phe Ala His Arg Pro 
20 25 30 

Leu Ser Ser He Ser Pro Arg Lys Leu Pro Asn Phe Arg Ala Phe Ser 
35 40 45 

Gly Thr Ala Met Thr Asp Thr Lys Asp Ala Gly Met. Asp Ala Val Gin 
50 55 60 

Arg Arg Leu Met Phe Glu Asp Glu Cys He Leu Val Asp Glu Thr Asp 
65 70 75 1 80 

Arg Val Val Gly His Val Ser Lys Tyr Asn Cys His Leu Met Glu Asn 

85 90 95 

He Glu Ala Lys Asn Leu Leu His Arg Ala Phe Ser Val Phe Leu Phe 
100 105 110 

Asn Ser Lys Tyr Glu Leu Leu Leu Gin Gin Arg Ser Asn Thr Lys Val 
115 120 125 

Thr Phe Pro Leu Val Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr 
130 135 ' 140 

Arg Glu Ser Glu Leu He Gin Asp Asn Ala Leu Gly Val Arg Asn Ala 
145 150 155 160 

Ala Gin Arg Lys Leu Leu Asp Glu Leu Gly He Val Ala Glu Asp Val 
165 170 175 



ir 
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Pro Val Asp Glu Phe Thr Pro Leu Gly Arg Met Leu Tyr Lys Ala Pro 
t 180 185 190 

Ser Asp Gly Lys Trp Gly Glu His Glu Leu Asp Tyr -Leu Leu Phe lie 
195 200 205 

Val Arg Asp Val Lys Val Gin Pro Asn Pro Asp Glu Val Ala Glu lie 
210 215 220 

Lys Tyr Val Ser Arg Glu Glu Leu Lys Glu Leu Val Lys Lys Ala Asd 
225 230 235 240 

« 

Ala Gly Glu Glu Gly Leu Lys Leu Ser Pro Trp Phe Arg Leu Val Val 
245 ' 250 255 

Asp Asn Phe Leu Met Lys Trp Trp Asp His Val Glu Lys Gly Thr Leu 
'2 60 2 65 27 0 

Val Glu Ala He Asp Met Lys Thr He His Lys Leu . . 

275 280 

<210> 11 
<211> 287 
<212> PRT 

<213> Clarkia breweri 
<400> 17 

Met Ser Ser 'Ser Met Leu Asn Phe Thr Ala Ser Arg, He Val Ser Leu 
1 5 10 15 

Pro Leu Leu Ser Ser Pro Pro Ser Arg Val His Leu Pro Leu Cys Phe 
20 25 30 

Phe Ser Pro He Ser Leu Thr Gin Arg Phe Ser Ala Lys Leu Thr Phe 
35 40 45 

Ser Ser Gin Ala Thr Thr Met Gly Glu Val Val Asp Ala Gly Met Asp 
50 55 60 

Ala Val Gin Arg Arg Leu Met Phe Glu Asp Glu Cys He Leu Val Asp 
65 " 70 75 80 

Glu Asn Asp* Lys Val Val Gly His Glu Ser Lys Tyr Asn Cys His Leu 

8 5 90 95 

Met Glu Lys He Glu Ser Glu Asn Leu Leu His Arg Ala Phe Ser Val 
100 105 110 

Phe Leu Phe Asn Ser Lys Tyr Glu Leu Leu Leu Gin Gin Arg Ser Ala 
115 * 120 125 

Thr Lys Val Thr Phe Pro Leu Val Trp Thr Asn Thr Cys Cys Ser His 
130 135 140 

Pro Leu Tyr Arg Glu Ser Glu Leu lie Asp Glu Asn Cys Leu Gly Val 
145 150 155 160 

Arg Asn Ala Ala Gin Arg Lys Leu Leu Asp Glu Leu Gly He Pro Ala 
165 170 175 

Glu Asp Leu Pro Val Asp Gin Phe He Pro Leu Ser Arg He Leu Tyr 
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180 185 •' "'. * • 190 

Lys Ala Pro Ser Asp Gly Lys Trp Gly Glu His Glu Leu Asp Tyr Leu 
195 * 200 205 

Leu Phe lie lie Arg Asp Val Asn Leu Asd Pro Asn Pro Asp Glu Val 
210 215 220 

Ala Glu Val Lys Tyr Met Asn Arg Asp Asp Leu Lys Glu Leu Leu Arq 
225 230 235 240 

Lys Ala Asp Ala Glu Glu Glu Gly Val Lys Leu Ser Pro Trp Phe Arg 
245 250 255 

Leu Val Val Asp Asn Phe Leu Phe Lys Trp Trp Asp His Val Glu Lys 
260 h 265 . 270 

Gly Ser Leu Lys Asp Ala Ala Asp Met Lys Thr He His Lys Leu 
275 , . , 280 , 285 

<210> 18 

<211> 261 , ' 

<212> PRT 

<213> Arabidopsis thallana 
<400> 18 

Thr Gly Pro Pro Pro Arg Phe Phe Pro He Ar§ Ser Pro Val Pro Arg 
1 ; 5 10 15 

Thr Gin Leu Phe Val Arg Ala Phe Ser Ala Val Thr Met Thr Asp Ser 
20 25 30 

Asn Asp Ala Gly Met Asp Ala Val Gin Arg Arg Leu Met Phe Glu Asp 
35 40 45 

Glu Cys He Leu Val Asp Glu Asn Asn Arg Val Val Gly His Asp Thr 
50 55 ~ 60 

Lys Tyr Asn Cys His Leu Met Glu Lys He Glu Ala Glu Asn Leu Leu 
65 ..... 70 75 80 

His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu Leu Leu 
85 90 95 

Leu Gin Gin Arg Ser Lys Thr Lys Val Thr Phe Pro Leu Val Trp Thr 
100 105 HO 

Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser Glu Leu He Glu 
115 120 125 

Glu Asn Val Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu Phe Asp 
130 135 140 

Glu Leu Gly He Val Ala Glu Asp Val Pro Val Asp Glu Phe Thr Pro 
145 150 155 160 

Leu Gly Arg Met Leu Tyr Lys Ala Pro Ser Asp Gly Lys Trp Gly Glu 
165 170 175 

His Glu Val Asp Tyr Leu Leu Phe He Val Arg Asp Val Lys Leu Gin 
180 185 " ' 190 



»7 
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Pro Asn Pro Asp Glu Val Ala Glu lie Lys Tyr Val Ser Arg Glu Glu 

195, - 200 205 

* 

Leu Lys Glu Leu Val Lys Lys Ala Asp Ala Gly Asp. Glu Ala Val Lys 

210 215 220 i 

Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe Leu Met Lys Trp 
225 230 235 240 

Trp Asp His Val Glu Lys Gly Thr He Thr Glu Ala Ala Asp Met Lvs 
245 250 255 

Thr He His Lys Leu 
260 

<210> 19 
<211> 288 
<212> PRT. 

<213> Saccharomyces cerevisiae 
<400> 19 

Met Thr Ala Asp Asn Asn Ser Met Pro His Gly Ala Val Ser Ser Tyr 
15 10 15 

Ala Lys Leu Val Gin Asn Gin Thr Pro Glu Asp He Leu Glu Glu Phe 
20 25 30 

Pro Glu lie He Pro Leu Gin Gin Arg Pro Asn Thr Arg Ser Ser Glu 
35 40 45 

Thr Ser Asn Asp Glu Ser Gly Glu Thr Cys Phe Ser Gly His Asp Glu 
50 55 60 

Glu Gin He Lys Leu Met Asn Glu Asn Cys He Val Leu Asp Trp Asp 
65 ^ 70 75 80 

Asp Asn Ala He Gly Ala Gly Thr Lys Lys Val Cys His Leu Met Glu 
85 90 95 

Asn He Glu Lys Gly Leu Leu His Arg Ala Phe Ser Val Phe He Phe 
100 105 110 

Asn Glu Gin Gly Glu Leu Leu Leu Gin Gin Arg Ala Thr Glu Lys He 
115 J 120 125 

Thr Phe Pro Asp Leu Trp Thr Asn Thr Cys Cys Ser His Pro Leu Cys 
130 135 140 

He Asp Asp Glu Leu Gly Leu Lys Gly Lys Leu Asp Asp Lys He Lys 
145 150 155 160 

Gly Ala He Thr Ala Ala Val Arg Lys Leu Asp His Glu Leu Gly He 
165 170 175 

Pro Glu Asp Glu Thr Lys Thr Arg Gly Lys Phe His Phe Leu Asn Arg 
180 185 190 

He His Tyr Met Ala Pro Ser Asn Glu Pro Trp Gly Glu His Glu He 
195 200 205 

Asp Tyr He Leu Phe Tyr Lys He Asn Ala Lys Glu Asn Leu Thr Val 
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210 215 • 220 

Asn Pro Asn Val Asn Glu Val Arg Asp Phe Lys Trp Val Ser Pro Asn 
225 230 235 240 

Asp Leu Lys Thr Met Phe Ala Asp Pro Ser Tyr Lys Phe Thr Pro Trp 
24 5 250 255 • 

Phe Lys lie lie Cys Glu Asn Tyr Leu Phe Asn Trp Trp Glu Gin Leu 
260 265 270 

Asp Asp Leu Ser Glu Val Glu Asn Asp Arg Gin lie His Arg Met Leu 
275 280 285 

<210> 20 
<211> 456 ,. . 
<212> PRT 

<213> Artificial- Sequence 

<220> ' 
<223> Description of Artificial Sequence: Consensus 
sequence of four plant B-cyclases 

<400> 20 . 
Met Asp Thr Leu Leu Lys Thr Pro Asn Leu Glu Phe Leu Pro His Gly 
1 5 10 15 ' 

Phe Val Lys Ser 'Phe Ser Lys Phe Gly Lys Cys Glu Gly Val Cys Val 
20 ' 25 30 

Lys Ser Ser Ala Leu Leu Glu Leu Val Pro Glu Thr Lys Lys Glu Asn 
35 40 t 45 

Leu Asp Phe Glu Leu Pro Met Tyr Asp Pro Ser Lys Gly Val Val Asp 
50 55 60 

Leu Ala Val Val Gly Gly Gly Pro Ala Gly Leu Ala Val Ala Gin Gin 
65 70 75 80 

Val Ser Glu Ala Gly Leu Ser Val Cys Ser lie Asp Pro Pro Lys Leu 
• '85 90 95 

lie Trp Pro Asn Asn Tyr Gly Val Trp Val Asp Glu Phe Glu Ala Met 
100 105 110 

Asp Leu Leu Asp Cys Leu Asp Ala Thr Trp Ser Gly Ala Val Tyr lie 
115 120 125 

Asp Asp Thr Lys Asp Leu Arg Pro Tyr Gly Arg Val Asn Arg Lys Gin 
130 135 " 140 

Leu Lys Ser Lys Met Met Gin Lys Cys lie Asn Gly Val Lys Phe His 
145 150 ~ 155 ~ " 160 

Gin Ala Lys Val lie Lys Val lie His Glu Glu Lys Ser Met Leu lie 
165 170 " 175 

Cys Asn Asp Gly Thr lie Gin Ala Thr Val Val Leu Asp Ala Thr Gly 
180 185 190 

Phe Ser Arg Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin 



1*1 
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195 200 205 

♦ . • 

Val Ala Tyr Gly lie Leu Ala Glu Val Glu Glu His Pro Phe Asp Lys 
210 " " 215 220- 

Met Val Phe Met Asp Trp Arg Asp Ser His Leu Asn Asn Glu Leu Lys 
225* 230 235 240 

Glu Arg Asn Ser lie Pro Thr Phe Leu Tyr Ala Met Pro Phe Ser Ser 
245 250 . 255 

Asn Arg lie Phe Leu Glu Glu Thr Ser Leu Val Ala Arg Pro Gly Leu 
2 60 2 65 270 

Arg Met Asd Asp He Gin Glu Arg Met Val Ala Arg Leu His Leu Gly 
,275 280 • 285 

He Lys Val Lys Ser He Glu Glu Asp Glu His Cys Val He Pro Met 
290 295 300 . . 

Gly Gly Pro Leu Pro Val Leu Pro Gin Arg Val Val Gly He Gly Gly 
305 310 ' 315 320 

Thr Ala Glv Met Val His Pro Ser Thr Gly Tyr Met Val Ala Arg Thr 
325 330 335 

Leu Ala Ala Ala Pro Val Val Ala Asn Ala He He Tyr Leu Gly Ser 
' 340 345 , 350 

Glu Ser Ser Gly Glu Leu Ser Ala Glu Val Trp Lys Asp Leu Trp Pro 
355 360 365 

He Glu Arg Arg Arg Gin Arg Glu Phe Phe Cys Phe Gly Met Asp lie 
370 " 375 380 

Leu Leu Lys Leu Asp Leu Pro Ala Thr Arg Arg Phe Phe Asp Ala Phe 
385 390 395 400 

Phe Asp Leu Glu Pro Arg Tyr Trp His Gly Phe Leu Ser Ser Arg Leu 
405 410 415 

Phe Leu Pro Glu Leu He Val Phe Gly Leu Ser Leu Phe Ser His Ala 
■ * 420 425 430 

Ser Asn Thr Ser Arg Glu He Met Thr Lys Gly Thr Pro Leu Val Met 
4 35 440 4 45 

He Asn Asn Leu Leu Gin Asp Glu 
450 455 

<210> 21 
<211> 524 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 21 

Met Glu Cys Val Gly Ala Arg Asn Phe Ala Ala Met Ala Val Ser Thr 
1 5 10 15 

Phe Pro Ser Trp Ser Cys Arg Arg Lys Phe Pro Val Val Lys Arg Tyr 
20 25 30 
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Ser Tyr Arg 
35 



Asn lie 



Arg Phe 



Gly Leu Cys Ser Val Arg Ala Ser Gly 
40 45 



Gly Gly Ser Ser Gly Ser Glu Ser Cys Val Ala Val Arg Glu Asp Phe 
50 55 60 

Ala Asp Glu Glu Asp Phe Val Lys Ala Gly Gly Ser Glu lie Leu 'Phe 
65 ' 70 75 80 

• Val Gin Met Gin Gin Asn Lys Asp Met Asp Glu Gin Ser Lys Leu Val 

85 90 95 

Asp Lys Leu Pro Pro lie Ser lie Gly Asp Gly Ala Leu Asp His Val 
100 105 • 110 

Val lie Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala 
115 * , ' 120 125 

Lys Leu Gly Leu Lys Val Gly Leu lie Gly Pro As.p Leu Pro Phe Thr 
130 135 140 

Asn Asn Tyr Gly Val Trp 'Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin 
145 " J 150 155' . 160 

Lys Cys lie Glu His Val Trp Arg Glu Thr lie Val Tyr Leu Asp Asp 
165 , 170 , 175 

Asp Lys Pro lie Thr lie Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg 
180 185 190 

Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu Ser Gly Val Ser 
195 200 ■ 205 

Tyr Leu Ser Ser Lys Val Asp Ser lie Thr Glu Ala Ser Asp Gly Leu 
210 215 220 

Arg Leu Val Ala Cys Asp Asp Asn Asn Val lie Pro Cys Arg Leu Ala 
225 230 235 240 

Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val 
245 250 255 

Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu 
260 " 265 270 

Val Glu Asn Ser Pro Tyr Asp Pro Asp Gin Met Val Phe Met Asp Tyr 
275 ~ 280 285 

Arg Asp Tyr Thr Asn Glu Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro 
290 295 300 

Thr Phe Leu Tyr Ala Met Pro Met Thr Lys Ser Arg Leu Phe Phe Glu 
305 310 315 32C 

Glu Thr Cys Leu Ala Ser Lys Asp Val Met Pro Phe Asp Leu Leu Lys 
325 ^ 330 335 

Thr Lys Leu Met Leu Arg Leu Asp Thr Leu Gly lie Arg lie Leu Lys 
340 345 350 

Thr Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser Leu Pro 
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360 365 



35^ ■ 

Asn Thr Giu Gin Lys Asn Leu Ala Phe Giy Ala Ala. Ala Ser Met Val 

370 375 

His >ro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro 



385 



390 



Lys Tyr Ala Ser Val He Ala Glu He Leu Arg Glu Glu Thr Thr Lys 



405 



Gin He Asn Ser Asn He Ser Arg Gin Ala Trp Asp Thr Leu Trp Pro 

420 425 . 
Pro Glu.Arg Lys Arg Gin Arg Ala Phe Phe Leu Phe Gly Leu Ala Leu 

435 440 
lie Val Gin Phe Asp Thr Glu Gly He Arg Ser Phe Phe Arg Thr Phe 

450 455 
Phe Arg Leu Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Thr Leu 

Thr Ser Gly Asp Leu Val Leu Phe Ala Leu Tyr Met Phe Val He Ser 

4 85 490 
Pro Asn Asn Leu Arg Lys Gly Leu lie Asn His Leu lie Ser Asp Pro 



500 



Thr Gly Ala Thr Met He Lys Thr Tyr Leu Lys Val 
515 520 

<210> 22 
<211> 1898 
<212> DNA 

<213> Adonis palaestina 

AAAGGAGTGT TCTATTAATG TTACTGTCGC ATTCTTGCAA CACTTATATT CAAACTCCAT 60 
TTTCTTCTTT TCTCTTCAAA ACAACAAACT AATGTGAGCA GAGTATCTGG CTATGGAACT 120 
ACTTGGTGTT CGCAACCTCA TCTCTTCTTG CCCTGTGTGG ACTTTTGGAA CAAGAAACCT 
TAGTAGTTCA AAACTAGCTT ATAACATACA TCGATATGGT TCTTCTTGTA GAGTAGATTT 
TCAAGTGAGA GCTGATGGTG GAAGCGGGAG TAGAAGTTCT GTTGCTTATA AAGAGGGTTT 
TGTGGATGAA GAGGATTTTA TCAAAGCTGG TGGTTCTGAG CTTTTGTTTG TCCAAATGCA 
GCAAACAAAG TCTATGGAGA AACAGGCCAA GCTCGCCGAT AAGTTGCCAC CAATACCTTT 
TGGAGAATCC GTGATGGACT TGGTTGTAAT AGGTTGTGGA CCTGCTGGTC TTTCACTGGC 
TGCAGAAGCT GCTAAGCTAG GGTTGAAAGT TGGCCTTATT GGTCCTGATC TTCCTTTTAC 
AAATAATTAT GGTGTGTGGG AAGACGAGTT CAAAGATCTT GGACTTGAAC GTTGTATCGA 
GCATGCTTGG AAGGACACCA TCGTATATCT TGATAATGAT GCTCCTGTCC TTATTGGTCG 
TGCATATGGA CGAGTTAGTC GACATTTGCT ACATGAGGAG TTGCTGAAAA GGTGTGTGGA 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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GTCAGGTGTA TCATATCTTG ATTCTAAAGT GGAAAGGATC ACTGAAGCTG GTGATGGCCA 7 80 

TAGCCTTGTA GTTTGTGAAA ATGAGATCTT TATCCCTTGC AGGCTTGCTA CTGTTGCATC 8 40 

TGGAGCAGCT TCAGGGAAAC TTTTGGAGTA TGAAGTAGGt GGCCCTCGTG TTTGTGTCCA 900 

AACCGCTTAT GGGGTGGAGG TTGAGGTGGA GAACAATCCA TACGATCCCA ACTTAATGGT 9 60 

ATTCATGGAC TACAGAGACT ATATGCAACA GAAATTACAG TGCTCGGAAG AAGAATATCC 1020 

AACATTTCTC TATGTCATGC CCATGTCGCC AACAAGACTT TTTTTTGAGG AAACCTGTTT 108 0 

GGCCTCAAAA GATGCCATGC CATTCGATCT ACTGAAGAGA AAACTGATGT CACGATTGAA 114 0 

GACTCTGGGT ATCCAAGTTA CAAAAGTTTA TGAAGAGGAA TGGTCATATA TTCCTGTTGG 1200 

TGGTTCTTTA CCAAACACAG AGCAAAAGAA CCTAGCATTT GGTGCTGCAG CAAGCATGGT 1260 

GCATCCAGCA ACAGGCTATT CGGTTGTACG GTCACTGTCA GAAGCTCCAA AATATGCTTC 132 0 

TGTAATTGCA AAGATTTTGA AGCAAGATAA CTCTGCGTAT GTGGTTTCTG GACAAAGTAG 138 0 

TGCAGTAAAC ATTTCAATGC AAGCATGGAG CAGTCTTTGG ' C C AAAG GAG C GAAAACGTCA 144 0 

AAG AG CAT T C TTTCTTTTTG GATTAGAGCT TATTGTGCAG CTAGATATTG AAGCAACCAG 1500 

AACATTCTTT AGAACCTTCT TCCGCTTGCC AACTTGGATG TGGTGGGGTT TCCTTGGGTC 15 60 

TTCACTATCA TCTTTCGATC TCGTCTTGTT TTCCATGTAC ATGTTTGTTT TGGCGCCAAA 1620 

CAGCATGAGG ATGTCACTTG TGAGACATTT GCTTTCAGAT CCTTCTGGTG CAGTTATGGT 1680 

AAG AG C T T AC CTCGAAAGGT AGTCTCATCT ATTATTAAAC TCTAGTGTTT CACCAAATAA 174 0 

ATGAGGATCC TTCGAATGTG TATATGATCA TCTCTATGTA TATCCTGTAC TCTAATCTCA 1800 

TAAAGTAAAT GCCGGGTTTG ATATTGTTGT GTCAAACCGG CCAATGATAT AAAGTAAATT 18 60 

TATTGATACA AAAGTAGTTT TTTTCCTTAA AAAAAAAA 18 98 

<210> 23 
<211> 529 
<212> PRT 

<213> Adonis palaestina 
<400> 23 

Met Glu Leu Leu Gly Val Arg Asn Leu lie Ser Ser Cys Pro Val Trp 
1 5 " 10 15 

Thr Phe Gly Thr Arg Asn Leu Ser Ser Ser Lys Leu Ala Tyr Asn lie 
20 ' 25 30 

His Arg Tyr Gly Ser Ser Cys Arg Val Asp Phe Gin Val Arg Ala Asp 
35 40 45 

Gly Gly Ser Gly Ser Arg Ser Ser Val Ala Tyr Lys Glu Gly Phe Val 
50 55 60 

Asp Glu Glu Asp Phe lie Lys Ala Gly Gly Ser Glu Leu Leu Phe Val 
65 ^ 70 75 80 

2.3 
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Gin Met Gin Gin Thr Lys Ser Met Glu Lys Gin Ala Lys Leu Ala Asp 

85 90 95 

Lys Leu Pro Pro lie Pro Phe Gly Glu Ser Val Met. Asp Leu Val Val 

100 ' 105 110 ♦ 

IleGly Cys Gly Pro Ala Gly Leu Ser Leu Ala Ala Glu Ala Ala Lys 
115 120 125 

Leu Gly Leu Lys Val Gly Leu lie Gly Pro Asp Leu Pro Phe Thr Asn 
130 J 135 140 

Asn Tyr Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Glu Arg 
145 150 ' 155 160 

Cys fie Glu His Ala Trp Lys Asp Thr lie Val Tyr Leu Asp Asn Asp 
165 170 175 

Ala Pro Val Leu lie Gly Arg Ala Tyr Gly Arg Val Ser Arg His Leu 
180 185 190 

Leu His Glu Glu Leu Leu Lys Arg Cys Val Glu Ser Gly Val Ser Tyr 
195 200 205 

Leu Asp Ser Lys Val Glu Arg lie Thr Glu Ala Gly Asp Gly His Ser 
210 " 215 220 

Leu Val Val Cys Glu Asn Glu lie Phe lie Pro Cys Arg Leu Ala Thr 
225 " 230 235 240 

Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Glu Tyr Glu Val Gly 
245 250 255 

Gly Pro Arg Val . Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val 
260 ~ 265 270 

Glu Asn Asn Pro Tyr Asp Pro Asn Leu Met Val Phe Met Asp Tyr Arg 
275 280 285 

Asp Tyr Met Gin Gin Lys Leu Gin Cys Ser Glu Glu Glu Tyr Pro Thr 
290 " 295 300 

Phe Leu Tyr Val Met Pro Met Ser Pro Thr Arg Leu Phe Phe Glu Glu 
305 310 315 320 

Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Arg 
325 330 335 

Lys Leu Met Ser Arg Leu Lys Thr Leu Gly lie Gin Val Thr Lys Val 
34 0 34 5 350 

Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser Leu Pro Asn 
355 " 360 365 

Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His 
370 375 380 

Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys 
385 ~ 390 395 400 

Tyr Ala Ser Val lie Ala Lys lie Leu Lys Gin Asp Asn Ser Ala Tyr 
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4 05 4lV 4 15 

Val Val Ser Giy Gin Ser Ser Ala Val Asn lie Ser Met Gin Ala Trp 
420 425 430 

Ser Ser Leu Trp Pro , Lys Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu 
4 35 4 40 445 

Phe Gly Leu Glu Leu He Val Gin Leu Asp He Glu Ala Thr Arg Thr 
450 455 460 

Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp Trp Gly Phe 
465 470 475 480 

Leu Gly Ser Ser Leu Ser Ser Phe Asp Leu Val Leu Phe Ser Met Tyr 
, ■ 4 85 4 90 . 4 95 

Met Phe Val Leu Ala Pro Asn Ser Met Arg Met Ser Leu Val Arg His 
500 , ... 505 " , . ,510 

Leu Leu Ser Asp Pro Ser Gly Ala Val Met Val Arg Ala Tyr Leu Glu 
515 , 520 '525, . 

Arg 

<210> 24 
<211> 1370 
<212> DNA 
<2i3> Potato 

<400> 24 



TAGCGGAGGA 


TGAGTTCAAA 


GATCTTGGTC 


TTCAAGCCTG 


CATTGAACAT 


GTTTGGCTGG 


60 


GATACCATTG 


TATATCTTGA 


TGATGATGAT 


CCTATTCTTA 


TTGGCCGTGC 


CTATGGAAGA 


120 


GTTAGTCGCC 


ATTTACTGCA 


CGAGGAGTTA 


CTCAAAAGGT 


GTGTGGAGGC 


AGGTGTTTTG 


180 


TATCTAAACT 


CGAAAGTGGA 


TAGGATTGTT 


GAGGCCACAA 


ATGGCCACAG 


TCTTGTAGAG 


240 


TGCGAGGGTG 


ATGTTGTGAT 


TCCCTGCAGG 


TTTGTGACTG 


TTGCATCGGG 


AGCAGCCTCG 


300 


GGGAAATTCT 


TGCAGTATGA 


GTTGGGAGGT 


CCTAGAGTTT 


CTGTTCAAAC 


AGCTTATGGA 


360 


GTGGAAGTTG 


AGGTCGATAA 


CAATCCATTT 


GACCCGAGCC 


TGATGGTTTT 


CATGGATTAT 


420 


AGAGACTATG 


TCAGACACGA 


CGCTCAATCT 


TTAGAAGCTA 


AATATCCAAC 


ATTTCTCTAT 


480 


GCCATGCCCA 


TGTCTCCAAC 


ACGAGTCTTT 


TTCGAGGAAA 


CTTGTTTGGC 


TTCAAAAGAT 


540 


GCAATGCCAT 


TCGATCTGTT 


AAAGAAAAAA 


TTGATGTTAC 


GATTGAACAC 


CCTCGGTGTA 


600 


AGAATTAAAG 


AAATTTATGA 


GGAGGAATGG 


TCTTACATAC 


CAGTTGGAGG 


ATCTTTGCCA 


660 


AATACAGAAC 


AAAAAACACT 


TGCATTTGGT 


GCTGCTGCTA 


GCATGGTTCA 


TCCAGCCACA 


720 


GGTTATTCAG 


TCGTCAGATC 


ACTGTCTGAA 


GCTCCAAAAT 


GCGCCTTCGT 


GCTTGCAAAT 


780 


ATATTACGAC 


AAAATCATAG 


CAAGAATATG 


CTTACTAGTT 


CAAGTACCCC 


GAGTATTTCA 


840 


ACTCAAGCTT 


GGAACACTCT 


TTGGCCACAA 


GAACGAAAAC 


GACAAAGATC 


GTTTTTCCTA 


900 
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TTTGGACTGG CTCTGATATT GCAGCTGGAT ATTGAGGGGA TAAGGTCATT TTTCCGCGCG 960 

< 

TTCTTCCGTG TGCCAAAATG GATGTGGCAG GGATTTCTTG GTTCAAGTCT TTCTTAGCAG 1020 ■ 

ACCTCATGTT ATTTGCCTTC TACATGTTTA TTATTGCACC AAATGACATG AGAAGAGGCT 1080 ♦ 

TAATCAGACA TCTTTTATCT GATCCTACTG GTGCAACATT GATAAGAACT TATCTTACAT 1140 

TTTAGAGTAA ATTCCTCCTA CAATAGTTGT TGAAAGAGGC CTCATTACTT CAGATTCATA 12 00 

ACAGAAATCG CGGTCTCTCG AGGCCTTGTA TATAACATTT TCACTAGGTT AATATTGCTT 12 60 

GAATAAGTTG CACAGTTTCA GTTTTTGTAT CTGCTTCTTT TTTGTCCAAG ATCATGTATT 1320 

GACCAATTTA TATACATTGC CAGTATATAT AAATTTTATA AAAAAAAAAA 1370 

<210> 25 "' 

<211> 377 

<212> PRT 

<213> Potato 

<400> 25 

Asp Glu Phe Lys Asp Leu Gly Leu Gin Ala Cys lie Glu His Val Trp 
1 5 10 15 

Arg Asp Thr lie Val . Tyr Leu Asp Asp Asp Asp Pro lie Leu lie Gly 
20 25 30 

Arg Ala Tyr Gly Arg Val Ser Arg His Leu Leu His' Glu Glu Leu Leu 
35 4 0 4 5 

Lys Arg Cys Val Glu Ala Gly Val Leu Tyr Leu Asn Ser Lys Val Asp 
50 55 60 ' . 

Arg lie Val Glu Ala Thr Asn Gly His Ser Leu Val Glu Cys Glu Gly 
65 70 75 80 

Asp Val Val lie Pro Cys Arg Phe Val Thr Val Ala Ser Gly Ala Ala 

85 90 95 

Ser Gly Lys Phe Leu Gin Tyr Glu Leu Gly Gly Pro Arg Val Ser Val 
100 105 110 

Gin Thr Ala Tyr Gly Val Glu Val Glu Val Asp Asn Asn Pro Phe Asp 
115 120 125 

Pro Ser Leu Met Val Phe Met Asp Tyr Arg Asp Tyr Val Arg His Asp 
130 135 140 

Ala Gin Ser Leu Glu Ala- Lys Tyr Pro Thr Phe Leu Tyr Ala Met Pro 
145 150 155 160 

Met Ser Pro Thr Arg Val Phe Phe Glu Glu Thr Cys Leu Ala Ser Lys 
165 170 175 

Asp Ala Met Pro Phe Asp Leu Leu Lys Lys Lys Leu Met Leu Arg Leu 
180 185 ^ 190 

Asn Thr Leu Gly Val Arg lie Lys Glu lie Tyr Glu Glu Glu Trp Ser 
195 200 205 



2Jo 
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Tyr lie Pro Val Gly Gly Ser Leu Pro Ash' Thr GlO Gin Lys Thr Leu 
210 215 . 220 

Ala Phe Gly Ala Ala Ala Ser Met Val His Pro Ala Thr Gly Tyr Ser 
225 230 235 240 

Val Val Arg Ser Leu Ser Glu Ala Pro Lys Cys Ala Phe Val Leu Ala 
245 250 255 

Asn lie Leu Arg Gin Asn His Ser Lys Asn Met Leu Thr Ser Ser Ser 
260 265 270 

Thr Pro Ser lie Ser Thr Gin Ala Trp Asn Thr Leu Trp Pro Gin Glu 
275 280 ( 285 

Arg Lys Arg Qln Arg Ser Phe Phe Leu Phe Gly Leu Ala Leu lie Leu 
290 295 " 300 

Gin Leu Asp lie Gl.u Gly He Arg Ser Phe Phe Arg Ala Phe Phe Arg 
305 310 315 ' 320 

Val Pro Lys Met Met Trp Qly Phe Leu Gly Ser Ser 'Leu Ser Xaa Ala 
325 330 , 335 

Asp Leu Met Leu Phe Ala Phe Tyr Met Phe He He Ala Pro Asn Asp 
340 345 350 

* * 

Met Arg Arg Gly Leu lie Arg His Leu Leu Ser Asp Pro Thr Gly Ala 
355 360 365 

Thr Leu He Arg Thr Tyr Leu Thr Phe 
370 375 

<210> 26 

<211> 533 

<212> PRT 

<213> Chimeric lettuce/potato 

<400> 26 

Met Glu Cys Phe Gly, Ala Arg Asn Met Thr Ala Thr Met Ala Val Phe 
1 • 1 5 10 15 

Thr Cys Pro Arg Phe Thr Asp Cys Asn He Arg His Lys Phe Ser Leu 
20 25 - 30 

Leu Lys Gly Arg Arg Phe Thr Asn Leu Ser Ala Ser Ser Ser Leu Arg 
35 40 45 

Gin He Lys Cys Ser Ala Lys Ser Asp Arg Cys Val Val Asp Lys Gin 
50 55 60 

Gly lie Ser Val Ala Asp Glu Glu Asp Tyr Val Lys Ala Gly Gly Ser 
65 70 75 80 

Glu Leu Phe Phe Val Gin Met Gin Arg Thr Lys Ser Met Glu Ser Gin 

85 90 95 

Ser Lys Leu Ser Glu Lys Leu Ala Gin He Pro He Gly Asn Cys He 
100 105 110 

Leu Asp Leu Val Val lie Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala 
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115 120 125 

» 

Ala*Glu Ser' Ala Lys Leu Gly Leu Asn Val Gly Leu lie Gly Pro Asp 
130 135 140 

♦ 

Leu Pro Phe Thr Asn Asn Tyr Gly Val Trp Gin Asp Glu Phe lie Glv 
145. 150 155 160 

Leu Gly Leu Glu Gly Cys lie Glu His Ser Trp Lys Asp Thr Leu Val 
165 170 175 

Tyr Leu Asd Asp Ala Asp Pro lie Arg lie Gly Arg Ala Tyr Gly Arg 
180 185 190 

Val His Arg Asp Leu Leu His Glu Glu Leu- Leu Arg Arg Cys Val Glu 
195 200 205 

Ser Gly Val Ser Tyr Leu Ser Ser Lys Val Glu Arg lie Thr Glu Ala 
210 ' 215 220 

Pro Asn Gly Tyr Ser Leu lie Glu Cys Glu Gly Asn lie Thr lie Pro 
225 . ~ 230 • 235 • 240 

Cys Arg Leu Ala Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Phe Leu 
245 250 255 

Glu Tyr Glu Leu Gly Gly Pro Arg Val Ser Val Gin Thr Ala Tyr Gly 
. 260 265 270 

Val Glu Val Glu Val Asp Asn Asn Pro Phe Asp Pro Ser Leu Met Val 
275 280 285 

Phe Met Asp Tyr Arg Asp Tyr Val Arg His Asp Ala' Gin Ser Leu Glu 
290 295 300 

Ala Lys Tyr Pro Thr Phe Leu Tyr Ala Met Pro Met Ser Pro Thr Arg 
305 " 310 315 320 

Val Phe Phe Glu Glu Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe 
325 330 335 

Asp Leu Leu Lys Lys Lys Leu Met Leu Arg Leu Asn Thr Leu Gly Val 
340 345 350 

Arg lie Lys Glu lie Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly 
355 . 360 365 

Gly Ser Leu Pro Asn Thr Glu Gin Lys Thr Leu Ala Phe Gly Ala Ala 
370 375 380 

Ala Ser Met Val His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu 
385 390 395 400 

Ser Glu Ala Pro Lys Cys Ala Phe Val Leu Ala Asn lie Leu Arg Gin 
405 410 415 

Asn His Ser Lys Asn Met Leu Thr Ser Ser Ser Thr Pro Ser lie Ser 
420 425 430 

Thr Gin Ala Trp Asn Thr Leu Trp Pro Gin Glu Arg Lys Arg Gin Arg 
435 440 445 
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Ser Phe Phe Leu Phe Gly Leu Ala Leu 'lie Leu Gin Leu Asp He Glu 
450 455 • 460 

Gly He Arg Ser Phe Phe Arg Ala Phe Phe Arg Val Pro Lys Trp Met 
465 470 475 480 

Trp Gin Gly Phe Leu Gly Ser Ser Leu Ser Xaa Ala Asp Leu Met • Leu 
485 490 * 495 

Phe Ala Phe Tyr Met Phe He He Ala Pro Asn Asp Met Arg Arg Gly 
500 505 510 

Leu lie Arg His Leu Leu Ser Asp Pro Thr Gly Ala Thr Leu He Arg 
515 520 t 525 

Thr Tyr Leu Thr Phe 
530 

<210> 27 , . , , . • 

<211> 374 
<212> PRT 

<213> Arabidopsis thalia,na 
<400> 27 

Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin Lys Cys He Glu His Val 
1 5 10 15 ' 

Trp Arg Glu Thr lie Val Tyr Leu Asp Asp Asp Lys Pro He Thr He 
20 ' 25 30 

Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg Leu Leu His Glu Glu Leu 
35 40 t 45 

Leu Arg Arg Cys Val Glu Ser Gly Val Ser Tyr Leu Ser Ser Lys Val 
50 55 60 

Asp Ser He Thr Glu Ala Ser Asp Gly Leu Arg Leu Val Ala Cys Asp 
65 70 75 80 

Asp Asn Asn Val 1,1 e Pro Cys Arg Leu Ala Thr Val Ala Ser Gly Ala 
■ '85 90 95 

Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val Gly Gly Pro Arg Val Cys 
100 105 - - 110 

Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val Glu Asn Ser Pro Tyr 
115 120 125 

Asp Pro Asp Gin Met Val Phe Met Asp Tyr Arg Asp Tyr Thr Asn Glu 
130 135 140 

Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro Thr Phe Leu Tyr Ala Met 
145 150 155 160 

Pro Met Thr Lys Ser Arg Leu Phe Phe Glu Glu Thr Cys Leu Ala Ser 
165 170 175 

Lys Asp Val Met Pro Phe Asp Leu Leu Lys Thr Lys Leu Met Leu Arg 
180 185 " " 190 

Leu Asp Thr Leu Gly He Arg lie Leu Lys Thr Tyr Glu Glu Glu Trp 
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195 

♦ . ■ 

Ser Tyr lie Pro Val 
210 

Leu Ala Phe Gly Ala 
225 ■ 

Ser Val Val Arg Ser 
245 

Ala Glu lie Leu Arg 
260 

Ser Arg* Gin Ala Trp 
27 5 

Arg Ala Phe Phe Leu 
290 

Glu Gly lie Arg Ser 
305 

Met Trp Gin Gly Phe 
325 

Leu Phe Ala Leu Tyr 
' 340 

Gly Leu lie Asn His 
355 

Lys Thr Tyr Leu Lys 
370 

<210> 28 
<211> 1002 
<212> DNA 
<213> Adonis palaestina 

<400> 28" 

ATTCATCTTC AGCAGCGCTG TCGTACTCTT TCTATATCTT CTTCCATCAC TAACAGTAGT 60 

CGCCGACGGT TGAATCGGCT ATTCGCCTCA ACGTCAACTA TGGGTGAAGT CACTGATGCT 120 

GGAATGGATG CTGTTCAGAA GCGGCTCATG TTCGACGACG AATGTATTTT GGTGGATGAG 180 

AATGACAAGG TCGTCGGGCA TGATTCCAAA TACAACTGTC ATTTGATGGA AAAGATAGAG 24 0 

GCAGAAAATT TGCTTCACAG AGCCTTCAGT GTTTTCTTGT TCAACTCAAA ATATGAATTG 300 

CTTCTTCAGC AACGATCCGC CACAAAGGTA ACATTCCCGC TCGTATGGAC AAACACATGT 360 

TGCAGTCATC CTCTCTTTCG TGATTCCGAG CTCATAGAAG AAAATTATCT CGGTGTACGA 420 

AACGCTGCAC AAAGAAAGCT TTTAGACGAG CTAGGCATTC CAGCTGAAGA TGTCCCAGTT 4 80 

GATGAATTTA CTCCTCTTGG TCGCATTCTT TACAAAGCTC CATCTGACGG CAAATGGGGA 54 0 

GAGCACGAAT TGGACTATCT CCTATTTATT GTCCGAGATG TGAAATACGA TCCAAACCCA 600 



Gly Gly 
215 

Ala Ala 
230 



200 

Ser Leu Pro Asn 



Ser Met Val His 
235 



Leu Ser Glu Ala Pro Lys 
250 



Glu Glu 



Thr Thr Lys Gin 
265 



Asp Thr Leu Trp Pro Pro 
280 



205 

Thr Glu 
220 . 

Pro Ala 
Tyr Ala 
lie Asn 



Glu Arg 
285 



Gin Lys Asn 



Thr Gly Tvr 
240 

Ser Val He 
255 

Ser Asn He 
270 . 

Lys Arg Gin 



Phe Gly 
295 

Phe Phe 
310 

Leu Gly 
Met Phe 



Leu Ala Leu He 



Val Gin 
300 



Arg Thr Phe Phe Arg Leu 
315 : 

Ser Thr Leu Thr Ser Gly 
330 

Val lie Ser Pro Asn Asn 
345 



Phe Asp Thr 



Pro Lys Trp 
320 

Asp Leu Val 
335 

Leu Arg Lys 
350 



Leu He Ser Asp Pro Thr Gly Ala 
360 365 



Thr Met He 



Val 



3o 
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GATGAAGTTG CTGATGCTAA GTATGTTAAT CGCGAGGAGT TGAGAGAGAT ACTGAGAAAA 660 

GCTGATGCTG GTGAAGAGGG ACTCAAGTTG TCTCCTTGGT TTAGATTGGT TGTTGATAAC 7 20 

TTTTTGTTCA AGTGGTGGGA TCATGTAGAG CAGGGTACGA TTAAGGAAGT TGCTGACATG 7 80 

AAAACTATCC ACAAGTTGAC TTAAGAGGAC TTCTCTCCTC TGTTCTACTA TTTGTTTTTT 84 0 

GCTACAATAA GTGGGTGGTG ATAAGCAGTT TTTCTGTTTT CTTTAATTTA TGGCTTTTGA 900 

ATTTGCCTCG ATGTTGAACT TGTAACATAT TTAGACAAAT ATGAGACCTT GTAAGTTGAA 960 

TTTGAGGCTG AATTTATATT TTTGGGAACA TAATAATGTT AA 1002 
<210> 29 

<211> 1271 . „ 
<212> DNA 

<213> Adonis palaestina 
<400> 29 

TTTTAAAGCT CTTTCGCTCC ACCACCATCA AAGCCAGCCA AATTTCTCTG TACAAAAGTT 60 

AAAAACACCG CTTTGGGCTT TGGCCCCTCC ATATCGGAAT* CCTTGTTTAC GATACGCATC 120 

TAAACCAGTA ATTCTCGGTT TTAATTTGTT TCCTAAATTA GGCCCCTTTC CGGAATCCCG 180 

AGAATTATGT CGTCGATCA£ • GATTAATCCT TTATATAGTA TCTTCTCCAC CACCACTAAA 240 

ACATTATCAG CTTCGTGTTC TTCTCCCGCT GTTCATCTTC AGCAGCGTTG TACGTACTCT 300 

TTCTATTTCT TCTTCCATCA CTAACAGTCC TCGCCGAGGG TTGAATCGGC TGTTCGCCTC 360 

AACGTCGACT ATGGGTGAAG TCGCTGATGC TGGTATGGAT GCCGTCCAGA AGCGGCTTAT 420 

GTTCGACGAT GAATGTATTT TGGTGGATGA GAATGACAAG GTCGTCGGAC ATGATTCCAA 4 80 

ATACAACTGT CATTTGATGG AAAAGATAGA G G C AG AAAAC TTGCTTCACA GAGCCTTCAG 54 0 

TGTTTTCTTA TTCAACTCAA AATACGAGTT GCTTCTTCAG CAACGATCTG CAACGAAGGT 600 

AACATTCCCG CTCGTATGGA CAAACACCTG TTGCAGCCAT CCCCTCTTCC GTGATTCCGA 660 

ACTCATAGAA GAAAATTTTC TCGGGGTACG AAACGCTGCA CAAAGGAAGC TTTTAGACGA 720 

GCTAGGCATT CCAGCTGAAG ACGTACCAGT TGATGAATTC ACTCCTCTTG GTCGCATTCT 780 

T T AC AAAG C T CCATCTGACG GAAAATGGGG AGAGCACGAA CTGGACTATC TTCTGTTTAT 84 0 

TGTCCGAGAT GTGAAATACG ATCCAAACCC AGATGAAGTT GCTGACGCTA AGTACGTTAA 900 

TCGCGAGGAG T T G AAAG AG A TACTGAGAAA AGCTGATGCA GGTGAAGAGG GAATAAAGTT 960 

GTCTCCTTGG TTTAGATTGG TTGTGGATAA CTTTTTGTTC AAGTGGTGGG ATCATGTAGA 1020 

GGAGGGGAAG ATTAAGGACG TCGCCGACAT GAAAACTATC CACAAGTTGA CTTAAGAGAA 1080 

AGTCTCTTAA GTTCTACTAT TTGGTTTTTG CTTCAATAAG TGGATGGTGA TGAGCAGTTT 1140 

TTATGCTTCC TTTAATTTTG GCTTTTCAAT TTGCTTTATG TGTTGAACTT GTAACATATT 1200 

TAGTCAAATA TGAGACCTTG TGAGTTGAAT TTGAGGTTAT ATTTATAGTT TTGGGAACAT 1260 

3>t 
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<210> 30 
<211> 1109 
<212> DNA 

<213> Haematococcus pluvialis 
<400> 30 

TGGAACCTGG CCCGGCGGCA GTCCGATGCC 

ACGCATATCC CGCGCGTGAA CTCCGCCCAG 

TTTAAGCTCA GGAGCATGCA GCTGCTTGCC 

AGCACCTGGG CAGGCGGGCA GTCGCAGGAT 

GTGGATGCTG ACGACAACAT CACAGGCCAT 

CCACATCAGC CTGCAGGCCT GCTGCACCGG 

GGGCGACTGC TGCTGCAACA GCGTGCACGC 

AACACCTGCT GCAGCCACCC TCTACATGGG 

CAGGTGGCCG ACGGCACAGT ACCTGGCGCA 

GAGCTGGGGA TACCAGCGCA CCAGCTGCCG 

CACTACTGTG CCGCGGACGT GCAGCCGGCT 

GAGATGGACT ACATCTTATT CATCCGGGCC 

GTGGACGAAG TCAGGTACGT GACGCAAGAG 

GGGTTGCAAT GGTCGCCGTG GTTTCGCATC 

GCTGACCTGG ACGCGGCCCT AAACACTGAC 

ATCAACGAAG CGTGAAGGCA GAAGCTGCAG 

CGTACTTGGC AGCTTCGTAT CTCCTTTTTC 

GGTGCATCAT ■» ATTCATCGTC TCTCTTTTGT 

TGAATCCTTT ACAACTTTCA AAAAAAAAA 

<210> 31 
<211> 985 
<212> DNA 

<213> Lactuca sativa 
<400> 31 

TGCCAAAATG TTGAAATTTC CCCCTTTTAA 
TTCTTCCTTC TTGCTGCCTC GGAAATCTTC 
TAGTGTTTTC CTCCACCCTC TTTCGTCTGC 
CCAGCGACGT CTCATGTTCG ATGACGAATG 
TGGCCATGAT ACTAAATACA ATTGTCATTT 



PCT/US99/12121 

1271 



GCGATGCTTC 
CAGCCCAGCT 
GAGGACCGCA 
GAGCTGATGC 
GCCAGCAAGC 
GCCTTCTCTG 
TCAAAAATCA 
CAGACCCCAG 
AAAGCTGCTG 
GCAAGCGCGT 
GCGACACAAT 
AACGTCACCT 
GAGCTGCGGC 
ATCGCCGCGC 
AAACACGAGG 
GATGTGAAGA 
TGAGACTGAA 
TTTAGACTAA 



GTTCGTTGCT 

GTGCACACGC 

CAGACCACAT 

TGAAGGACGA 

TGGAGTGCCA 

TGTTCCTGTT. 

CCTTCCCAAG 

ATGAGGTGGA 

CCATCCGCAA 

TTCGCTTCCT 

CAGCGCTCTG 

TGGCGCCCAA 

AG AT GAT G C A 

GCTTCCTTGA 

ATTGGGGAAC 

CACGTCATGG 

CCTGCAGAGC 

TCTGTAGCTA 



CAGAGGCCTC 
GCGACTCCAG 
GAGGGGTGCA 
GTGCATCTTA 
CAAATTCCTA 
TGACGACCAG 
TGTGTGGACG 
CCAACTAAGC 
GTTGGAGCAC 
CACGCGTTTG 
GGGCGAGCAC 
CCCTGACGAG 
GCCGGACAAC 
GCGTTGGTGG 
GGTGCATCAC 
GGTGGAATTG 
TAGAGTCAAT 
GAGTCACTGA 



AACCATTGCT ACCATGATCT CTTCTCCATA 
TTTCCCTCCA ATGCCGTCTC TCGCAGCCGC 
CGCTATGGGC GATTCCAGCA TGGATGCTGT 
CATTTTGGTG GATGAGAATG ACAAAGTGGT 
GATGGAGAAG ATTGAAAAGG GAAATATGCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1109 



60 
120 
180 
240 
300 
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ACACAGAGCA 


TTCAGTGTGT 


TCTTGTTCAA 


CTCGAAATAT 


GAATTACTCC 


TTCAGCAACG 


360 


TTCTGCAACC 


AAGGTGACTT 


TCCCTTTGGT 


ATGGACAAAC 


ACGTGTTGCA 


GCCATCCACT 


420 


ATACAGGGAG 


AGTGAGCTTA 


TTGACGAAAA 


CGCCCTTGGG 


GTGAGGAATG 


CT GCAC A<3 AG 


480 


GAAGCTCCTG 


GATGAACTCG 


GCATCCCTGG 


AGCAGATGTT 


CCGGTTGATG 


AGTTCACTCC 


540 


ATTGGGTCGC 


ATTCTATACA 


AGGCCGCATC 


GGATGGAAAG 


TGGGGAGAAC 


ATGAACTTGA 


600 


TTACCTGCTG 


TTTATGGTAC 


GTGATGTTGG 


TTTGGATCCG 


AACCCAGATG 


AAGTGAAAGA 


660 


TGTAAAATAT 


GTGAACCGGG 


AAGAGCTGAA 


GGAATTGGTA 


AGGAAGGCGG 


ATGCTGGTGA 


720 


AGAGGGTGTG 


AAGCTGTCCC 


CGTGGTTCAA 


ATTGATTGTC 


GATAATTTCT 


TGTTTCAGTG 


780 


GTGGGATCGA 


CTCCATAAGG 


GAACCCTAAC 


CGAAGCTATT 


GATATGAAAA 


CAATCCACAA 


840 


ACTCACATAA 


AAACACT.ACA 


CTAGTAGGAG 


AG AG GAT TAT 


ATGAGATATT 


TGTTATATGT 


' 900 


GAAATTGAAA 


TTCAGATGAA 


TGCTTGTATT 


TATTTCTATT 


TGGACAAACT 


TCAACTTCTT 


960 


TTTGCTACCT 


TAT C AG AAAA 


TV 7\ 7\ 7\ 7\ 

AAAAA 
• 






• 


9'8 5 


<210> 32 
<211> 988 
<212> DNA 

<213> Lactuca sativa 




- 








<400> 32 
TATTCGCTTC 


AAAATCTCTT 


CCATTAACTG 


CTCAAATCTC 


CACCTTCGCC 


GGTCTTAATC 


60 


TCCGCCGGCG 


CACTTTCACC 


ACCATAACCG 


CCGCCATGGG 


TGACGATTCC 


GGCATGGACG 


120 


CTGTCCAGAG 


ACGTCTCATG 


TTTGATGATG 


AATGCATTTT 


GGTTGATGAA 


AATGACAATG 


180 


TTCTTGGGCA 


TGATACCAAA 


TACAATTGTC 


ACTTGATGGA 


GAAGATTGAG 


AAAGATAATT 


240 


TGCTTCATAG 


AGCATTCAGT 


GTATTTTTAT 


TCAATTCAAA 


ATACGAATTA 


CTCCTTCAGC 


300 


AAAGGTCAGA 


AACCAAGGTG 


ACATTTCCTT 


TGGTATGGAC 


AAACACCTGT 


TGCAGCCATC 


360 


CAC TAT AC AG 


AGAATCGGAG 


TTAATTCCCG 


AAAATGCCCT 


TGGGGTCAGA 


AATGCTGCAC 


420 


AGAGGAAGCT 


TCTAGATGAA 


CTCGGTATCC 


CTGCTGAAGA 


TGTTCCAGTT 


GATGAGTTCA 


480 


CAACTTTAGG 


TCGCATGTTG 


TACAAGGCTC 


CATCTGATGG 


AAAATGGGGT 


GAACATGAAG 


540 


TTGATTACCT 


ACTCTTCCTC 


GTGCGTGACG 


TTGCCGTGAA 


CCCAAACCCT 


GATGAGGTGG 


600 


CGGACATTAG 


ATACGTGAAC 


CAAGAAGAGT 


TAAAAGAGTT 


ACTAAGGAAG 


GCGGATGCGG 


660 


GTGAGGAGGG 


TTTGAAATTG 


TCCCCATGGT 


TTAGGCTAGT 


GGTGGACAAC 


TTCTTGTTCA 


720 


AATGGTGGGA 


TCATGTCCAA 


AAGGGGACAC 


TCAATGAAGC 


AATTGACATG 


AAAACC AT TC 


780 


ATAAGTTGAT 


ATGAAAAATG 


GTTAATATTT 


ATGGTGGTGG 


TTTGGAGCTA 


ATAATTTGTG 


840 


TGTTCAAGTC 


TCGGTCCTTC 


TTTTTTTAAC 


GTTTTTTTTT 


TTTCTTTTAT 


TGGGAGTGTT 


900 


TATTGTGTAC 


TTGTAACGTA 


GGCCCTTTGG 


TTACGCTTTA 


AGAGTTTAAT 


AAAGAACCAC 


960 



33 
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CGTTAATTTA 


AAAAAAAAAA 


AAAAAAAA 








988 


<2l6> 33 
<211> 1874 
<212> DNA 

<213> Chlamydomonas reinhardtii 










<400> 33 
GGCACGAGCT 


CGAGTTTGTT 


TTACCATGAC 


ATCGGGAATT 


TGGAAGCTTG 


AACTACCTCA 


60 


ATTACTCAAG 


TAACTCGCGG 


CAACACATTT 


CGCGCGCCAT 


CGCTGTTTTC 


TCTGCTCCAG 


120 


CTACCGAGCA 


GCATTGCTTT 


AGATCGCTTT 


GATGTCATAA 


ACTCCCACTT 


ATATGAGATC 


# 180 


CAGTTTCATC 


GAGCCCAAGC 


CCAGAGCGCA 


ACCTGTCTTA 


AGCCGCGGCA 


GGGCGTCCAT 


240 


GCGCCTCGCG 


CAAAGCCGTG 


CTCTCGTTGC 


GCGTGTCAGC 


TCCGCCCTGT 


GGCCGGGAGC 


300 


AGGACTTTCA 


CAGGCTCAAA 


GCGTTGCGGT 


GCGAATGGCG 


AGTTCGTCAA 


CCTGGGAAGG 


360 


CACGGGCCTG 


AGCCAGGATG 


ACTTCATGCA 


GCGGGACGAG 


TGCTTGGTGG 


TGGACGAGCA 


420 


GGACCGGCTG 


CTAGGCACCG 


CCAACAAGTA 


CGACTGCCAC 


CGCTTCGAGG 


CGGCCAAGGG 


480 


CCAGCCCTGC 


GGCCGCCTGC 


ACCGCGCCTT 


CTCCGTGTTC 


CTGTTCAGCC 


CCGACGGCCG 


540 


ACTGCTGCTG 


CAGCAGCGCG 


CAGCCAGCAA 


GGTGACGTTC 


CCGGGTGTGT 


GGACCAACAC 


600 


CTGCTGCTCG 


CACCCGCTGG 


CGGGCCAGGC 


GCCGGACGAG 


GTGGACCTGC 


CGGCGGCGGT 


660 


AGCCTCGGGC 


CAGGTGCCGG 


GCATCAAGGC 


GGCGGCGGTG 


CGCAAGCTGC 


AGCACGAGCT 


720 


GGGGATACCG 


CCGGAGCAGG 


TTCCCGCCTC 


CTCCTTCTCC 


TTCCTCACGC 


GTCTGCACTA 


780 


CTGCGCCGCC 


GACACCGCCA 


CGCACGGCCC 


GGCGGCGGAG 


TGGGGCGAGC 


ACGAGGTGGA 


840 


CTACGTGCTG 


TTCGTGCGGC 


CGCAGCAGCC 


CGTCAGCCTG 


CAGCCCAACC 


CAGACGAGGT 


900 


GGACGCCACG 


CGCTACGTGA 


CGCTGCCGGA 


GCTTCAGTCC 


ATGATGGCGG 


ACCCCGGCCT 


960 


CAGCTGGAGC 


CCCTGGTTCC 


GCATCCTGGC 


CACACAGCCC 


GCCTTCCTGC 


CCGCCTGGTG 


1020 


GGGCGACCTG AAGCGGCGCT 


GGCGCCCGGG 


CGGCAGCCGA 


CTGTCGGACT 


GGGGCACCAT 


1080 


CCACCGCGTC 


ATGTGAAGAA 


AAAGGGGAAG 


CAGGGGCGGG 


AGCGGGGGAT 


GAATGGGAAT 


1140 


GTGAATGCGA 


TTGTGATGCG 


GCGTGGGATG 


AGGTCTGAAG 


ACAGGGGGAA 


AATCGGGGGG 


1200 


CGGGCGTGAG 


CGTGTGTGTA 


CGTGAGCGAC 


AAAGCCGGGA 


GGCGGACCGC 


GCGATGGGTA 


1260 


CATGTGTGTG 


CGGAGGGTCG 


GTGGGTCGGT 


CGGTTGCGCG 


GCATAGCGTG 


TTGTGTGTGT 


1320 


GCGGCTGCAG 


GGGTATGTGG 


GCACCCGGGC 


ACGGAGGAGA 


AGGCACACGC 


AGGTGGCGCG 


1380 


GAGGTGTGTC 


AGGGGCCATG 


GGCGGGCCTC 


ACTCCTGGTC 


GTGCCCAGTG 


GTCTCGTGGG 


14 4 0 


CAGAGTGGCA 


GGGGCTGCAC 


CCATATGAGC 


GGCGCACTGC 


CGCGCTGGGC 


TAAGTCCTTA 


1500 


TCACTTGGTG 


AGGTGGGGCG 


AGGTGGCTGT 


GGGCGGCGGG 


CGCAGTGGCA 


GAAGGACACG 


1560 


GTGTGTGAGC 


GGTGGAGCTC 


TGGCCGTGCC 


GGCCGTGAGG 


GGCGGATAGC 


GATATGACGT 


1620 



SS 
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TGTGCTTGGC CGCTGTAATG CGGGAGAATG TGCAGGCCGC * GAGAAGCGGG CGGTGGCAGG 1680 

AGGCCGCAGG CTGCAGCACC CGTTGGGGAG GTGCCACCTG CAGGCGCGGC GCCGGGCGGG 174 0 

CCTGAGTAAT GGGCGCCTGA GTAGTGGCGG CCACAGGAGG CGCAGGAGGC AGCAGCAGGA 1800 

GGACGAGCTG GAGGGACCCG TTGGCAACCC AAGGTTGCGC GTGTAACATA GTGGCCATAC 18 60 

AAAAAAAAAA AAAA 1874 

1 <210> 34 
<211> 954 
<212> DNA 

<213> Tagetes erecta 
<400> 34 . ■„ 

CCAAAAACAA CTCAAATCTC CTCCGTCGCT CTTACTCCGC CATGGGTGAC GACTCCGGCA 60 

TGGATGCTGT TCAGCGACGT CTCATGTTTG ACGATGAATG GATTT.TGGTG GATGAGTGTG ' 120 

ACAATGTGGT GGGACATGAT ACCAAATACA ATTGTCACTT GATGGAGAAG ATTGAAACAG 180. 

GTAAAATGCT GCACAGAGCA TTCAGCGTTT TTCTATTCAA TTCAAAATAC GAGTTACTTC 24 0 

TTCAGCAACG GTCTGCAACC AAGGTGACAT TTCCTTTAGT ATGGACCAAC ACCTGTTGCA 300 

GCCATCCACT CT AC AG AG AA, \ TCCG AGCT TG TTCCCGAAAA* CGCCCTTGGA GTAAGAAATG 360 

CTGCACAGAG GAAGCTGTTG GATGAACTCG GTATCCCTGC TGAAGATGTT CCCGTTGATC 4 20 

AGTTTACTCC TTTAGGTCGC ATGCTCTACA AGGCTCCATC TGATGGAAAG TGGGGAGAAC 4 80 

ATGAACTTGA CTACCTACTT TTCATAGTGA GAGACGTTGC TGTAAACCCG AACCCAGATG 54 0 

AAGTGGCGGA TATCAAATAT GTGACCAGAA GAG T T AAAG G AGCTGCTAAG GAAAGCAGAT 600 

GCGGGGGAGG AGGGTTTGAA GCTGTCTCCA TGGTTCAGGT TAGTGGTTGA TAACTTCTTG 660 

TTCAAGTGGT GGGATCATGT GCAAAAGGGT ACACTCACTG AAGGAATTGA TATGAAAACC 720 

ATACACAAGC TGATATAGAA ACACACCCTC AACCGAAAAG TTCAAGCCTA ATAATTCGGG 78 0 

TTGGGTCGGG TCTACCATCA ATTGTTTTTT TCTTTTAAGA AGTTTTAATC TCTATTTGAG 84 0 

CATGTTGATT CTTGTCTTTT GTGTGTAAGA TTTTGGGTTT CGTTTCAGTT GTAATAATGA 900 

ACCATTGATG GTTTGCAATT TCAAGTTCCT ATCGACATGT AGTGATCTAA AAAA 954 

<210> 35 
<211> 1031 
<212> DNA 

<213> Oryza sativa 
<400> 35 

CCTCCCTTTG CCTCGCGCAG AGGCGGCCGC GCCTTCTCCG CCGCGAGGAT GGCCGGCGCC 60 

GCCGCCGCCG TGGAGGACGC CGGGATGGAC GAGGTCCAGA AGCGGCTCAT GTTCGACGAC 120 

GAATGCATTT TGGTGGATGA ACAAGACAAT GTTGTTGGCC ATGAATCAAA ATATAACTGC 180 

CATCTGATGG AAAAAATCGA ATCTGAAAAT CTACTTCATA GGGCTTTCAG TGTATTCCTG 240 
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TTCAACTCAA 


AATATGAACT 


CCTACTCCAG 


CAACGATCTG 


CAACAAAGGT 


TACATTTCCT 

• 


300 


CTAGTTTGGA 


CCAACACTTG 


CTGCAGCCAT 


CCTCTGTACC 


GTGAGTCTGA 


GCTTATACAG 


360 


GAAAACTACC 


TTGGTGTTAG 


AAATGCTGCT 


CAGAGGAAGC 


TCTTGGATGA 


GCTGGGCATC 


420 


CCAGCTGAAG 


ATGTGCCAGT 


TGACCAATTC 


ACCCCTCTTG 


GTCGGATGCT 


TTACAAGGCC 


480 


CCATCTGATG 


GAAAATGGGG 


TGAACACGAG 


CTTGACTACC 


TGCTGTTCAT 


CGTCCGCGAC 


540 


GTGAAGGTAG 


TCCCGAACCC 


GGACGAAGTG 


GCCGATGTGA 


AATACGTGAG 


CCGTGAGCAG 


600 


CTGAAGGAGC 


TCATCCGCAA 


AGCGGACGCC 


GGAGAGGAAG 


GCCTGAAGCT 


GTCTCCCTGG 


660 


TTCCGGCTGG 


TTGTTGACAA 


CTTCCTCATG 


GGCTGGTGGG 


ATCACGTCGA 


GAAAGGCACC 


720 


CTCAACGAGG 


CCGTGGACAT 


GG AG AC CATC 


CACAAGCTGA 


AGTAAGGACT 


GCGATGTTGT 


780 


GGCTGGAAAG' 


AATGATCCTG 


AAGACTCTGT 


TCTTGTGCTG 


CTGCATATTA 


CTCTTACCAG 


840 


rzrzjmrzTTn.rix 

uonnvj 1 1 Ij^M 




O/T-rtOV— 1 1 1 1 o 




UVj 1 1 i OOnvjU 




on n 
yuu 


TGGGCTCTGC 


TGACTGAGAG 


ATTCCCTTAT 


AGAGTGTCTA 


TGTTAATTTA 


GCAAACTTCT 


960 


ATATTATACA 


TGATTAGTTA 


ATTGTTCGGT 


GTCTGAATAA 


AGAACAATAG 


CATGTTCCAT 


1020 


GTTTATTTGC 


T 










1031 



<210> 36 
<211> 232 
<212> PRT 

<213> Tagetes erecta 
<400> 36 

Met Gly Asp Asp Ser Gly Met Asp Ala Val Gin Arg Arg Leu Met Phe 
1 5 10 15 

Asp Asp Glu Cvs lie Leu Val Asp Glu Cys Asp Asn Val Val Gly His 
20 25 30 

Asp Thr Lys Tyr Asn Cys His Leu Met Glu Lys lie Glu Thr Gly Lys 
.35 40 4 5 

Met Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu 
50 55 60 

Leu Leu Leu Gin Gin Arg Ser Ala Thr Lys Val Thr Phe Pro Leu Val 
65 70 75 80 

Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser Glu Leu 
85 90 95 

Val Pro Glu Asn Ala Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu 
100 105 110 

Leu Asp Glu Leu Gly lie Pro Ala Glu Asp Val Pro Val Asp Gin Phe 
115 120 125 

Thr Pro Leu Gly Arg Met Leu Tyr Lys Ala Pro Ser Asp Gly Lys Trp 
130 135 140 
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Gly Glu His Glu Leu Asp Tyr Leu Leu Phe lie* Val Arg Asp Val Ala 
145 150 ■ 155 160 

Val Asn Pro Asn Pro Asp Glu Val Ala Asp He Lys Tyr Val Ser His 
165 170 ^ 175 

Glu Glu Leu Lys Glu Leu Leu Arg Lys Ala Asp Ala Gly Glu Glu Gly 
180 , 185 190 

Leu Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe Leu Phe 
195 200 205 

Lys Trp Trp Asp His Val Gin Lys Gly Thr Leu Thr Glu Ala He Asp 
210 215 ( 220 

Met Lys Thr \le His Lys Leu He 
225 230 

<210> 37 ... , ■ ■ • 

<211> 280 
<212> PRT 

<213> Lactuca Sativa 

<400> 37 . 
Met Leu Lys Phe Pro Pro Phe Lys Thr He Ala Thr Met He Ser Ser 
15 10 15 

• * ' 
Pro Tyr Ser Ser Phe Leu Leu Pro Arg Lys Ser Ser Phe Pro Pro Met 
20 25 30 

Pro Ser Leu Ala Ala Ala Ser Val Phe Leu His Pro Leu Ser Ser Ala 
35 40 , 45 

Ala Met Gly Asp Ser Ser Met Asp Ala Val Gin Arg Arg Leu Met Phe 
50 55 60 

Asp Asp Glu Cys He Leu Val Asp Glu Asn Asp Lys Val Val Gly His 
65 70 75 80 

Asp Thr Lys Tyr Asn, Cys His Leu Met Glu Lys He Glu Lys Gly Asn 
■ 85 90 " 95 

Met Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu 
100 105 HO 

Leu Leu Leu Gin Gin Arg Ser Ala Thr Lys Val Thr Phe Pro Leu Val 
115 120 " 125 

Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser Glu Leu 
130 135 140 

lie Asp Glu Asn Ala Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu 
145 150 155 160 

Leu Asp Glu Leu Gly He Pro Gly Ala Asp Val Pro Val Asp Glu Phe 
165 170 175 

Thr Pro Leu Gly Arg lie Leu Tyr Lys Ala Ala Ser Asp Gly Lys Trp 
180 185 190 

Gly Glu His Glu Leu Asp Tyr Leu Leu Phe Met Val Arg Asp Val Gly 



2T7 
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195 200 205 

» » • 

Leu Asp Pro Asn Pro Asp Glu Val Lys Asp Val Lys Tyr Val Asn Arg 
210 215 220. 

♦ 

Glu Glu Leu Lys Glu Leu Val Arg Lys Ala Asp Ala Gly Glu Glu Gly 
225* 230 235 240 

Val Lys Leu Ser Pro Trp Phe Lys Leu lie Val Asp Asn Phe Leu Phe 
245 250 255 

Gin Trp Trp Asp Arg Leu His Lys Gly Thr Leu Thr Glu Ala lie Asp 
260 265 270. 

Met Lys Thr lie His Lys Leu Thr 
275 280 

<210> 38 
<211> 229' 
<212> PRT 

<213> Lactuca Sativa 
<400> 38 

Met Gly Asd Asp Ser Gly Met Asp Ala Val Gin Arg Arg Leu Met Phe 
1*5 10 15 

Asp Asp Glu Cys lie Leu Val Asp Glu Asn Asp Asn Val Leu Gly His 
' 20 25 ,30 

Asp Thr Lys Tyr Asn Cys His Leu Met Glu Lys lie Glu Lys Asp Asn 
35 40 45 

Leu Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu 
50 55 60 

Leu Leu Leu Gin Gin Arg Ser Glu Thr Lys Val Thr Phe Pro Leu Val 
65 70 75 80 

Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser Glu Leu 
85 90 95 

lie Pro Glu Asn Ala Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu 
.> 100 105 110 

Leu Asp Glu Leu Gly lie Pro Ala Glu Asp Val Pro Val Asp Glu Phe 
115 120 125 

Thr Thr Leu Gly Arg Met Leu Tyr Lys Ala Pro Ser Asp Gly Lys Trp . 
130 135 140 

Gly Glu His Glu Val Asp Tyr Leu Leu Phe Leu Val Arg Asp Val Ala 
145 150 155 160 

Val Asn Pro Asn Pro Asp Glu Val Ala Asp lie Arg Tyr Val Asn Gin 
165 170 175 

Glu Glu Leu Lys Glu Leu Leu Arg Lys Ala Asp Ala Gly Glu Glu Gly 
180 185 " 190 

Leu Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe Leu Phe 
195 200 205 



3r 
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Lys Trp Trp Asp His Val Gin Lys Gly Th'ir Leu' Asri Giu Ala lie Asp 
210 215 • 220 

Met Lys Thr lie His 
225 

<210> 39 
<211> 295 
<212> PRT 

<213> Adonis Palaestina 
<400> 39 

Met Ser Ser He Arg He Asn Pro Leu Tyr Ser He Phe Ser Thr Thr 
1 5 10, 15 

Thr Lys Thr I^.eu Ser Ala Ser Cys Ser Ser Pro Ala Val His Leu Gin 
20 25 30 

Gin Arg Cys Arg Thr Leu Ser, He Ser Ser Ser He Thr Asn Ser Pro 
35 40 '45 

Arg Arg Gly Leu Asn Arg ^eu Phe Ala Ser Thr Ser 'Thr Met Gly Glu 
50 55 ,60 

Val Ala Asp Ala Gly Met Asp Ala Val Gin Lys Arg Leu Met Phe Asp 
65 70 75 8'0 

Asp Glu Cys He lieu Val Asp Glu Asn Asp Lys Val Val Gly Tyr Asp 
" 85 90 95 

Ser Lys Tyr Asn Cys His Leu Met Glu Lys He Glu Ala Glu Asn Leu 
100 105 ( HO 

Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu Leu 
115 120 125 

Leu Leu Gin Gin Arg Ser Ala Thr Lys Val Thr Phe Pro Leu Val Trp 
130 135 140 

Thr Asn Thr Cys Cys, Ser His Pro Leu Phe Arg Asp Ser Glu Leu He 
145 . 150 155 * 160 

Glu Glu Asn Phe Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu Leu 
165 170 175 

Asp Glu Leu Gly He Pro Ala Glu Asp Val Pro Val Asp Glu Phe Thr 
180 185 190 

Pro Leu Gly Arg He Leu Tyr Lys Ala Pro Ser Asp Gly Lys Trp Gly 
195 200 205 

Glu His Glu Leu Asp Tyr Leu Leu Phe He Val Arg Asp Val Lys Tyr 
210 215 220 

Asp Pro Asn Pro Asp Glu Val Ala Asp Ala Lys Tyr Val Asn Arg Glu 
225 230 235 240 

Glu Leu Lys Glu He Leu Arg Lys Ala Asp Ala Gly Glu Glu Gly He 
245 250 255 

Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe Leu Phe Lys 



3*i 
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260 265 270 

Trp Tro Asp' His Val Glu Glu Gly Lys lie Lys Asp Val Ala Asp Met 
275 280 . 285 

Lys Thr lie His Lys Leu Thr 
290 295 

<210> 40 
<211> 234 
<212> PRT 

<213> Adonis Palaestina 
<400> 40 

Met Gly* Glu Val Thr Asp Ala Gly Met Asp Ala Val Gin Lys Arg Leu 
1 5 10 15 

Met Phe Asp Asp Glu Cys lie Leu Val Asp Glu Asn Asp Lys Val Val 
20 25 30 

Gly His Asp Ser Lys Tyr Asn Cys His Leu Met Glu Lys lie Glu Ala 
•35 * 4 0 45 

Glu Asn Leu Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys 
50 55 60 

Tyr Glu Leu Leu Leu Gin Gin Arg Ser Ala Thr Lys Val Thr Phe Pro 
65 • 70 75- 80 

Leu Val Trp Thr Asn Thr Cys Cys Ser His Pro Leu Phe Arg Asp Ser 
8 5 90 95 

Glu Leu lie Glu Glu Asn Tyr Leu Gly Val Arg Asn Ala Ala Gin Arg 
100 105 110 

Lys Leu Leu Asp Glu Leu Gly lie Pro Ala Glu Asp Val Pro Val Asp 
115 120 125 

Glu Phe Thr Pro Leu Gly Arg lie Leu Tyr Lys Ala Pro Ser Asp Gly 
130 135 140 

Lys Trp Gly Glu His Glu Leu Asp Tyr Leu Leu Phe lie Val Arg Asp 
145 ~ 150 155 160 

Val Lys Tyr Asp Pro Asn Pro Asp Glu Val Ala Asp Ala Lys Tyr Val 
165 170 175 

Asn Arg Glu Glu Leu Arg Glu lie Leu Arg Lys Ala Asp Ala Gly Glu 
180 185 190 

Glu Gly Leu Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe 
195 " 200 205 

Leu Phe Lys Trp Trp Asp His Val Glu Gin Gly Thr lie Lys Glu Val 
210 215 220 

Ala Asp Met Lys Thr lie His Lys Leu Thr 
225 * 230 

<210> 41 
<211> 238 



M0 
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<212> PRT 

<213> Oryza Sativa 
<400> 41 

Met Ala Gly Ala Ala Ala Ala Val Glu Asp Ala Gly Met Asp Glu Val 
1 5 10 15 

Gin Lys Arg Leu Met Phe Asp Asp Glu Cys lie Leu Val Aso Glu Gin 
20 25 30 

Asp Asn Val Val Gly His Glu Ser Lys Tyr Asn Cys His Leu Met Glu 
35 40 45 

Lys lie Glu Ser Glu Asn Leu Leu His Arg # Ala Phe Ser Val Phe Leu 
50 55 60 

Phe Asn Ser Lys Tyr Glu Leu Leu Leu Gin Gin Arg Ser Ala Thr Lys 
65 .70 75 80 

Val Thr Phe Pro Leu Val Trp Thr Asn Thr Cys Cys Ser His Pro Leu 

85 90 95 

Tyr Arg Glu Ser Glu Leu lie Gin Glu Asn Tyr .Leu Gly Val Arg Asn 
100 ■ 105 " 110' 

Ala Ala Gin Arg Lys Leu Leu Asp Glu Leu Gly lie Pro Ala Glu Asp 
115 . ( f 120 1 125 

Val Pro Val Asp Gin Phe Thr Pro Leu Gly Arg Met Leu Tyr Lys Ala 
130 135 1.40 

Pro Ser Asp Gly Lys Trp Gly Glu His Glu Leu Asp Tyr Leu Leu Phe 
145 150 155 160 

He Val Arg Asp Val Lys Val Val Pro Asn Pro Asp Glu Val Ala Asp 
165 , 170 175 

Val Lys Tyr Val Ser Arg Glu Gin Leu Lys Glu Leu He Arg Lys Ala 
180 185 190 

Asp Ala Gly Glu diu Gly Leu Lys Leu Ser Pro Trp Phe Arg Leu Val 
195 • 200 205 

Val Asp Asn Phe Leu Met Gly Trp Trp Asp His Val Glu Lys Gly Thr 
210 215 220 

Leu Asn Glu Ala Val Asp Met Glu Thr He His Lys Leu Lys 
225 230 235 

<210> 42 
<211> 233 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 42 

Met Thr Asp Ser Asn Asp Ala Gly Met Asp Ala Val Gin Arg Arg Leu 
1 5 10 15 

Met Phe Glu Asp Glu Cys He Leu Val Asp Glu Asn Asn Arg Val Val 
20 25 30 
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Gly His Asp Thr Lys Tyr Asn Cys His Leu Met Glu Lys lie Glu Ala 
35 , ■ 40 45 

Glu Asn Leu Leu His Arg Ala Phe Ser Vai Phe Leu. Phe Asn Ser Lys 

50 55 60 ^ < 

Tyr -Glu Leu Leu Leu Gin Gin Arg Ser Lys Thr Lys Val Thr Phe Pro 
65 70 75 80 

Leu Val Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser 

8 5 90 95 

Glu Leu lie Glu Glu Asn Val Leu Gly Val Arg Asn Ala Ala Gin Arg 
100 105 110 

Lys Leu Phe Asp Glu Leu Gly lie Val Ala Glu Asd. Val Pro Val Asp 
115 120 125 

Glu Phe Thr Pro Leu Gly Arg Met Leu Tyr Lys Ala Pro Ser Asp Gly 
130 135 140 

Lys Trp Gly Glu His Glu Val Asp Tyr Leu Leu Phe lie Val Arg Asp 
145 150 155 160 

Val Lys Leu Gin Pro Asn Pro Asp Glu Val Ala Glu lie Lys Tyr Val 
165 170 175 

Ser Arg Glu' Glu Leu Lys Glu Leu Val Lys Lys Ala Asp Ala Gly Asp 
180 185 190 

Glu Ala Val Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe 
195 200 205 

Leu Met Lys Trp Trp Asp His Val Glu Lys Gly Thr lie Thr Glu Ala 
210 215 220 

Ala Asp Met Lys Thr He His Lys Leu 
225 230 

<210> 43 
<211> 293 
<212> £RT 

<213> Haematococcus pluvialis 
<400> 43 

Met Leu Arg Ser Leu Leu Arg Gly Leu Thr His He Pro Arg Val Asn 
15 10 15 

Ser Ala Gin Gin Pro Ser Cys Ala His Ala Arg Leu Gin Phe Lys Leu 
20 25 30 

Arg Ser Met Gin Leu Leu Ser Glu Asp Arg Thr Asp His Met Arg Gly 
35 4 0 4 5 

Ala Ser Thr Trp Ala Gly Gly Gin Ser Gin Asp Glu Leu Met Leu Lys 
50 ~ 55 60 

Asp Glu Cys He Leu Val Asp Val Glu Asp Asn He Thr Gly His Ala 
65 70 75 80 

Ser Lys Leu Glu Cys His Lys Phe Leu Pro His Gin Pro Ala Gly Leu 
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85 -9b * " 95 

Leu His Arg Ala Phe Ser Val Phe Leu Phe Asp Asp Gin Gly Arg Leu 
100 105 110 

Leu Leu Gin Gin Arg Ala Ara Ser Lys lie Thr Phe Pro Ser Val Trp 
115 120 125 

Thr Asn Thr Cys Cys Ser His Pro Leu His Gly Gin Thr Pro Asp Glu 
130 135 140 

Val Asp Gin Leu Ser Gin Val Ala Asp Gly Thr Val Pro Gly Ala Lys 
145 150 155 " 160 

Ala Ala Ala lie Arg Lys Leu Glu His Glu* Leu Gly lie Pro Ala His 
. 165 170 . 175 

Gin Leu Pro Ala Ser Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cys 
180 . ,185 , ■ 190 

Ala Ala Asp Val Gin Pro Ala Ala Thr Gin Ser Ala Leu Trp Gly Glu 
195 , 200 '205, 

His Glu Met Asp Tyr lie Leu Phe lie Arg Ala Asn Val Thr ' Leu Ala • 
210 215 220 

Pro Asn Pro Asp Glu Val Asp Glu Val Arg Tyr Val Thr Gin Glu Glu 
225 ; 2'30 235 240 

Leu Arg Gin Met Met Gin Pro Asp Asn Gly Leu Gin Trp Ser Pro Trp 
245 250 1 255 

Phe Arg lie lie Ala Ala Arg Phe Leu Glu Arg' Trp Trp Ala Asp Leu 
260 265 * 270 

Asp Ala Ala Leu Asn Thr Asp Lys His Glu Asp Trp Gly Thr Val His 
275 280 * 285 

His lie Asn Glu Ala 
290 

* • " 

<210> 44 
<211> 304 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 44 

Met Leu Arg Ser Leu Leu Arg Gly Leu Thr His lie Pro Arg Val Asn 
15 10 15 

Ser Ala Gin Gin Pro Ser Cys Ala His Ala Arg Leu Gin Phe Lys Leu 
20 25 30 

Arg Ser Met Gin Met Thr Leu Met Gin Pro Ser lie Ser Ala Asn Leu 
35 40 45 

Ser Arg Ala Glu Asp Arg Thr Asp His Met Arg Gly Ala Ser Thr Trp 
50 55 60 

Ala Gly Gly Gin Ser Gin Asp Glu Leu Met Leu Lys Asp Glu Cys lie 
65 70 75 80 



M3 
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Leu Val Asp Val Glu Asd Asn lie Thr Gly His Ala Ser Lys Leu Glu 
, . 85 90 95 

Cys His Lys Phe Leu Pro His Pro Ala Gly Leu Leu. His Arg Ala Phe 
100 105 110 

Ser Val Phe Leu Phe Asp Asp Gin Gly Arg Leu Leu Leu Gin Gin Arg 
115 120 125 

Ala Arg Ser Lys lie Thr Phe Pro Ser Val Trp Thr Asn Thr Cys Cys 
130 135 140 

Ser His Pro Leu His Gly Gin Thr Pro Asp Glu Val Asp Gin. Leu Ser 
145 150 155 160 

Gin Val #la Asp Gly Thr Val Pro Gly Ala Lys Ala Ala Ala lie Arg 
165 170 175 

Lys Leu Glu His Glu Leu Gly lie Pro Ala His Gin Leu Pro Ala Ser 
180 185 190 

Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cys Ala Ala Asp Val Gin 
195 200 205 

Pro Ala Ala Thr Gin Ser Ala Leu Trp Gly Glu His Glu Met Asp Tyr 
210 215 220 

lie Leu Phe' lie Arg Ala Asn Val Thr Leu Ala Pro Asn Pro Asp Glu 
225 230 235 240 

Val Asp Glu Val Arg Tyr Val Thr Gin Glu Glu Leu Arg Gin Met Met 
245 250 255 

Gin Pro Asp Asn Gly Leu Gin Trp Ser Pro Trp Phe Arg lie lie Ala 
260 265 270 

Ala Arg Phe Leu Glu Arg Trp Trp Ala Asp Leu Asp Ala Ala Leu Asn 
275 280 285 

Thr Asp Lys His Glu Asp Trp Gly Thr Val His His lie Asn Glu Ala 
290 " 295 300 

<210> 45 

<211> 307 

<212> PRT 

<213> Chlamydomonas reinhardtii 

<400> 45 

Met Arg Ser Ser Phe lie Glu Pro Lys Pro Arg Ala Gin Pro Val Leu 

1 5 10 15 _ _ _ 

Ser Arg Gly Arg Ala Ser Met Arg Leu Ala Gin Ser Arg Ala Leu Val 
20 25 30 

Aia Arg Val Ser Ser Ala Leu Trp Pro Gly Ala Gly Leu Ser Gin Ala 
35 40 45 

Gin Ser Val Ala Val Arg Met Ala Ser Ser Ser Thr Trp Glu Gly Thr 
50 55 60 

Gly Leu Ser Gin Asp Asp Phe Met Gin Arg Asp Glu Cys Leu Val Val 

MM 
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65 70 ; 75 80 

Asp Glu Gin Asp Arg Leu Leu Gly Thr Ala Asn Lys Tyr Asp Cys His 

85 90 95 

Arg Phe Glu Ala Ala Lys Gly Gin Pro Cys Gly Arg Leu His Arg Ala 
100 105 110 

Phe Ser Val Phe Leu Phe Ser Pro Asp Gly Arg Leu Leu Leu Gin Gin 
115 120 125 

Arg Ala Ala Ser Lys Val Thr Phe Pro Gly Val Trp Thr Asn Thr Cys 
130 135 140 

Cys Ser His Pro Leu Ala Gly Gin Ala Pro Asp Glu Val Asp Leu Pro 
145 " ' 150 155 ' 160 

Ala Ala Val Ala' Ser Gly Gin Val Pro Gly He Lys Ala Ala Ala Val 
1'65 * ■ ' 170 • " , • ' 175 

Arg Lys Leu Gin His Glu Leu Gly He Pro Pro Gly Gin Val Pro Ala 
180 ' 185 . - 190 

Ser Ser Phe Ser Phe Leu Thr Arg Leu His Tyr Cys Ala Ala Asp Thr 
195 200 205 

Ala Thr His Gly. Pro'''Aia Ala Glu Trp Gly Glu His Glu Val Asp Tyr 
210 215 220 

Val Leu Phe Val Arg Pro Gin Gin Pro Val Ser Leu Gin Pro Asn Pro 
225 230 235 240 

Asp Glu Val Asp Ala Thr Arg Tyr Val Thr Leu Pro Glu Leu Gin Ser 
245 250 255 

Met Met Ala Asp Pro Gly Leu Ser Trp Ser Pro Trp Phe Arg He Leu 
260 " 265 270 

Ala Thr Gin Pro Ala Phe Leu Pro Ala Trp Trp Gly Asp Leu Lys Arg 
275 <'" 280 285 

Arg Trp Arg Pro Gly Gly Ser Arg Leu Ser Asp Trp Gly Thr He His 
290 ~ 295 300 

Arg Val Met 
305 

<210> 46 
<211> 1848 
<212> DNA 

<213> Adonis palaestina 
<400> 46 

GAGAGAAAAA GAGTGTTATA TTAATGTTAC TGTCGCATTC TTGCAACACA TATTCAGACT 60 
CCATTTTCTT GTTTTCTCTT CAAAACAACA AACTAATGTG AC GG AG TAT C TAGCTATGGA 120 
ACTACTTGGT GTTCGCAACC TCATCTCTTC TTGCCCTGTC TGGACTTTTG GAACAAGAAA 180 
CCTTAGTAGT TCAAAACTAG CTTATAACAT ACATCGATAT GGTTCTTCTT GTAGAGTAGA 240 
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TTTT.CAAGTG AGGGCTGATG GTGGAAGCGG GAGTAGAACT 
TTTTGTGGAC GAGGAGGATT TTATCAAAGC TGGTGGTTCT 
GCAGCAAACA AAGTCTATGG AGAAACAGGC CAAGCTCGCC 
TTTCGGAGAA TCTGTGATGG ACTTGGTTGT AATAGGTTGT 
GGCTGCAGAA GCTGCTAAGC TAGGCTTGAA AGTTGGCCTT 
TACAAATAAT TATGGTGTGT GGGAAGACGA GTTCAAAGAT 
CGAGCATGCT TGGAAGGACA CCATCGTATA .TCTTGACAAT 
TCGTGCATAT GGACGAGTTA GCCGGCATTT GCTGCATGAA 
CGAGTCAGGT GTATCATATC TGAATTCTAA AGTGGAAAGG 
CCATAGTCTT GTAGTTTGTG AAAACGACAT CTTTATCCCT 
ATCTGGAGCA GCTTCAGGGA AACTTTTGGA GTATGAAGTA 
CCAAACTGCT TATGGTGTGG AGGTTGAGGT GGAGAACAAT 
GGTATTTATG GACTACAGAG ACTATATGCA ACAGAAATTA 
TCCAACATTT. CTCTATGTCA TGCCCATGTC GCCAACAAGA 
TTTGGCCTCA AAAGATGCCA TGCCTTTCGA TCTACTGAAG 
GAAGACTCTG GGTATCCAAG T T AC AAAAAT TTATGAAGAG 
TGGGGGTTCT TTACCAAACA CAGAGCAAAA GAACCTAGCA 
GGTGCATCCA GCAACAGGCT ATTCGGTTGT ACGATCACTA 
TTCTGTAATT GCAAAGATTT TGAAGCAAGA TAACTCTGCA 
CAGTGCAGTA AACATTTCAA TGCAAGCATG GAGCAGTCTT 
TCAAAGAGCA TTCTTTCTTT TCGGGTTAGA GCTTATTGTG 
CAGAACGTtt TTTAGAACCT TCTTCCGCTT GCCAACTTGG 
GTCTTCACTA TCATCTTTCG ATCTTGTATT GTTTTCCATG 
GAACAGCATG AGGATGTCAC TTGTGAGACA TTTGCTTTCA 
GGTTAAAGCT TACCTCGAAA GGTAATCTGT TTTATGAAAC 
AT GAG GAT CC TTCGTATATG TATATGATCA TCTCTATGTA 
TAAAGTAATC G AAAAT T CAT T G AT AGAAAA AAAAAAAAAA 

<210> 47 
<211> 529 
<212> PRT 
. <213> Adonis palaestina 

<400> 47 

Met Glu Leu Leu Gly Val Arg Asn Leu lie Ser 



TCTGTTGCTT 
GAGCTTTTGT 
GATAAGTTGC 
GGACCTGCTG 
ATTGGTCCTG 
CTTGGACTTG 
GATGCTCCTG 
GAGTTGCTGA 
ATCACTGAAG 
TGCAGGCTTG 
GGTGGCCCTC 
CCATACGATC 
CAGTGCTCGG 
CTTTTTTTTG 
AGAAAACTAA 
GAATGGTCTT 
TTTGGTGCTG 
TCAGAAGCTC 
TATGTGGTTT 
TGGCCAAAGG 
CAGCTAGATA 
ATGTGGTGGG 
TACATGTTTG 
GATCCTTCTG 
TATAGTGTCT 
TATCCTATAT 
AAAAAAAA 
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ATAAAGAGGG 300 

TTGTCCAAAT 360 ' 

CACCAATACC 4 20 ♦ 

GTCTTTCACT 4 80 

ATCTTCCTTT 540 

AACGTTGTAT 600 
* 

TCCTTATTGG 660 

AAAGGTGTGT 7 20 

CTGGTGATGG 7 80 

CTACTGTTGC 84 0 

GTGTTTGTGT 900 

CCAACTTAAT 960 

AAGAAGAATA 1020 

AGGAAACCTG 108 0 

TGTCACGATT 114 0 

ATATTCCTGT 1200 

CAGCAAGCAT 12 60 

CAAAATATGC 1320 

CTGGACAAAG 1380 

AGCGAAAACG 14 4 0 

TTGAAGCAAC 1500 

GTTTCCTTGG . 1560 

TTTTGGCCCC 162 0 

GTGCAGTTAT 1680 

CAT T AAAT AA 174 0 

TCTAATCTCA 1800 
1848 



Ser Cys Pro Val Trp 
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WO 99/63035 

1 5 IP 15 

Thr Phe Gly Thr Arg Asn Leu Ser Ser Ser Lys Leu Ala Tyr Asn lie 
20 25 30 

His Arg Tyr Gly Ser Ser Cys Arg Val Asp Phe Gin Val Arg Ala Asp 
35 . 40 45 

Glv Glv Ser Gly Ser Arg Ser Ser Val Ala Tyr Lys Glu Gly Phe Val 
50 55 60 

Asp Glu Glu Asp Phe lie Lys Ala Gly Gly Ser Glu Leu Leu Phe Val 
65 " 70 75 80 

Gin Met Gin Gin Thr Lys Ser Met Glu Lys Gin Ala Lys Leu Ala Asp 

" ' -85 90 ' 95 

Lvs Leu Pro. Pro lie Pro Phe Gly Glu Ser Val Met Asp Leu Val Val 
100 ' ' " 105 1 , ' 110 

lie Gly Cys Gly Pro Ala Gly Leu Ser Leu Ala Ala Glu Ala Ala Lys 
115 ' 120 125 

Leu Glv Leu Lys Val Gly Leu lie Gly Pro Asp Leu Pro Phe Thr Asn' 
130 135 140 

Asn Tvr Gly Val •Trp'Glu Asp Glu Phe Lys Asp Leu Gly Leu Glu Arg 
145 150 155 160 

Cvs lie Glu His Ala Trp Lys Asp Thr He Val Tyr Leu Asp Asn Asp 
165 170 175 

Ala Pro Val Leu He Gly Arg Ala Tyr Gly Arg Val Ser Arg His Leu 
180 185 190 

Leu His Glu Glu Leu Leu Lys Arg Cys Val Glu Ser Gly Val Ser Tyr 
195 200 205 

Leu Asp Ser Lys Val Glu Arg He Thr Glu Ala Gly Asp Gly His Ser 
210 . '"' 215 220 

Leu Val Val Cys Glu Asn Glu He Phe He Pro Cys Arg Leu Ala Thr 
225 230 235 240 

Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Glu Tyr Glu Val Gly 
245 250 255 

Glv Pro Arq Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val 
260 265 270 

Glu Asn Asn Pro Tyr Asp Pro Asn Leu Met Val Phe Met Asp Tyr Arg 
275 280 285 

Asp Tvr Met Gin Gin Lys Leu Gin Cys Ser Glu Glu Glu Tyr Pro Thr 
290 295 300 

Phe Leu Tyr Val Met Pro Met Ser Pro Thr Arg Leu Phe Phe Glu Glu 
305 310 315 320 

Thr Cvs Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Arg 
325 * 330 335 
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Lys Leu Met Ser Arg Leu Lys Thr Leu Gly lie Gin Val Thr Lys Val 
,340 345 350 

Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly Gly. Ser Leu Pro Asn 
355 360 365 

Thr" Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His 
370 ~ 375 380 

Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys 
385 390 395 400 

Tyr Ala Ser Val lie Ala Lys lie Leu Lys Gin Asp Asn Ser. Ala Tyr 
405 ' 410 415 

Val yal £er Gly Gin Ser Ser Ala Val Asn lie Ser Met Gin Ala Trp 
420 425 430 

Ser Ser Leu Trp Pro Lys Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu 
435 " 4 40 4 45 

Phe Gl</ Leu Glu Leu lie Val Gin Leu Asp lie Glu Ala Thr Arg Thr 
450 455 460 

Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp Trp Gly Phe 
465 " 470 475 480 

Leu Gly Ser Ser Leu Ser Ser Phe Asp Leu Val Leu Phe Ser Met Tyr 
485 490 ' 495 

Met Phe Val Leu Ala Pro Asn Ser Met Arg Met Ser Leu Val Arg His 
500 505 510 

Leu Leu Ser Asp Pro Ser Gly Ala Val Met Val Arg Ala Tyr Leu Glu 
515 520 525 

Arg 

<210> 48 
<211> 378 
<212> PRT 
<213> Potato 

<400> 48 

Asp Glu Phe Lys Asp Leu Gly Leu Gin Ala Cys He Glu His Val Trp 
1 5 10 15 

Arq Asp Thr He Val Tyr Leu Asp Asp Asp Asp Pro He Leu He Gly 
20 25 30 

Arg Ala Tyr Gly Arg Val Ser Arg His Leu Leu His Glu Glu Leu Leu 
35 40 45 

Lys Arg Cys Val Glu Ala Gly Val Leu Tyr Leu Asn Ser Lys Val Asp 
50 55 60 

»rg lie. Val Glu Ala Thr Asn Gly His Ser Leu Val Glu Cys Glu Gly 
65 70 75 80 

Asp Val Val He Pro Cys Arg Phe Val Thr Val Ala Ser Gly Ala Ala 
85 ' 90 95 
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Ser Gly Lys Phe Leu Gin Tyr Glu Leu Gly Gly' Pro Arg Val Ser Val 
100 105 110 

Gin Thr Ala Tyr Gly Val Glu Val Glu Val Asp Asn Asn Pro Phe Asp 
115 120 * 125 

Pro Ser Leu Met Val Phe Met Asp Tyr Arg Asp Tyr Val Arg His Asp 
130 ' 135 140 

Ala Gin Ser Leu Glu Ala Lys Tyr Pro Thr Phe Leu Tyr Ala Met Pro 
145 150 155 160 

Met Ser Pro Thr Arg Val Phe Phe Glu Glu Thr Cys Leu Ala Ser Lys 
165 170. 175 

Asp Ala Met Pro Phe Asp Leu Leu Lys Lys Lys Leu Met Leu Arg Leu 
180 185 190 

Asn Thr Leu Gly Va'l Arg lie Lys Glu lie Tyr Gly Glu Glu Trp Ser 
195 " 200 205 

Tyr lie Pro Val Gly Gly Ser Leu Pro Asn Thr Glu Gin Lys Thr Leu 
210 , 215 '220 

Ala Phe Gly Ala Ala Ala Ser Met Val His Pro Ala Thr Gly Tyr Ser 
225 ~" 230 235 240 

\ • * 

Val Val Arg Ser Leu Ser Glu Ala Pro Lys Cys Ala Phe Val Leu Ala 
245 250 255 

Asn lie Leu Arg Gin Asn His Ser Lys Asn Met Leu Thr Ser Ser Ser 
260 265 • 270 

Thr Pro Ser lie Ser Thr Gin Ala Trp Asn Thr Leu Trp Pro Gin Glu 
275 280 285 

Arg Lys Arg Gin Arg Ser Phe Phe Leu Phe Gly Leu Ala Leu lie Leu 
290 ~ 295 300 

Gin Leu Asp He Glu Gly He Arg Ser Phe Phe Arg Ala Phe Phe Arg 
305 ' 310 315 320 

Val Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Ser Leu Ser Xaa 
325 330 335 

Ala Asp Leu Met Leu Phe Ala Phe Tyr Met Phe He He Ala Pro Asn 
340 345 350 

Asp Met Arg Arg Gly Leu He Arg His Leu Leu Ser Asp Pro Thr Gly 
355 360 365 

Ala Thr Leu He Arg Thr Tyr Leu Thr Phe 
370 375 



PCT/US99/12121 



<210> 49 

<211> 524 

<212> PRT 

<213> Arabidopsis 

<400> 49 



thaliana 
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Met Glu Cys Val Gly Ala Arg Asn Phe Ala Ala Met Ala Val Ser Thr 
I • " , « * 5 10 15 

Phe Pro Ser Trp Ser Cys Arg Arg Lys Phe Pro Val Val Lys Arg Tyr 
20 25 30 

Ser' Tyr Arg Asn lie Arg Phe Gly Leu Cys Ser Val Arg Ala Ser Gly 
35 40 45 

Glv Glv Ser Ser Gly Ser Glu Ser Cys Val Ala Val Arg Glu Asp Phe 
50 ^ 55 60 

Ala Asp Glu Glu Asp Phe Val Lys Ala Gly Gly Ser Glu He Leu Phe 
65 70 75 80 

Val -Gin-Met Gin Gin Asn Lys Asp Met Asp Glu Gin Ser Lys Leu Val 
85 90 95 

Aso Lvs Leu Pro Pro He Ser He Gly Asp Gly Ala Leu. Asp His Val 
P Y 100 105 110 

Val He Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala 
115 " 120 125 

Lvs Leu Gly Leu Lys Val Gly. Leu He Gly Pro Asp Leu Pro Phe Thr 
130 135 140 

Asn Asn Tyr Gly Val Trp Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin 
145 150 155 160 

Lys Cys He Glu His Val Trp Arg Glu Thr He Val Tyr Leu Asp Asp 
165 170 175 

Asp Lys Pro lie Thr He Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg 
180 185 190 

Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu Ser Gly Val Ser 
195 200 205 

Tvr Leu Ser Ser Lys Val Asp Ser He Thr Glu Ala Ser Asp Gly Leu 
210 " 215 220 

Arq Leu Val Ala Cys Asp Asp Asn Asn Val He Pro Cys Arg Leu Ala 
225 230 235 240 

Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val 
245 250 255 

Glv Glv Pro Arq Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu 
Y 260 265 270 

Val Glu Asn Ser Pro Tyr Asp Pro Asp Gin Met Val Phe Met Asp Tyr 
275 280 285 

Arq Asp Tyr Thr Asn Glu Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro 
290 295 300 

Thr Phe Leu Tyr Ala Met Pro Met Thr Lys Ser Arg Leu Phe Phe Glu 
305 310 315 320 

Glu Thr Cys Leu Ala Ser Lys Asp Val Met Pro Phe Asp Leu Leu Lys 

SL? 
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325 '33-0 1 335 

Thr Lys Leu Met Leu Arg Leu Asp Thr Leu Gly lie Arg lie Leu Lys 
340 345 350 

Thr Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser Leu Pro 
355 360 365 

Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val 
370 375 380 

His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro 
385 390 395 400 

Lys Tyr Ala Ser Val He Ala Glu He Leu Arg Glu Glu Thr Thr Lys 
- • A05 410 ' 415 

Gin He Asn Ser' Asn lie Ser Arg Gin Ala Trp Asp Thr Leu Trp Pro 
420 • • 425 . . ■ 430 

Pro Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu Phe Gly Leu Ala Leu 
4 35 ■ 4 40 ' 4 45 

He Val Gin Phe Asp Thr Glu Gly lie Arg Ser Phe Phe Arg Thr Phe 
450 455 460 

Phe Arg Leu Pro , Lys. '.Trp Met Trp Gin Gly Phe Leu Gly Ser Thr Leu 
465 • 470 475 480 

Thr Ser Gly Asp Leu Val Leu Phe Ala Leu Tyr Met Phe Val He Ser 
485 490 495 

Pro Asn Asn Leu Arg Lys Gly Leu He Asn His Leu lie Ser Asp Pro 
500 505 510 

Thr Gly Ala Thr Met He Lys Thr Tyr Leu Lys Val 
515 520 

<210> 50 
<211> 529 
<212> PRT 

<213> Adonis paiaestina 
<400> 50 

Met Glu Leu Leu Gly Val Arg Asn Leu He Ser Ser Cys Pro Val Trp 
15 10 15 

Thr Phe Gly Thr Arg Asn Leu Ser Ser Ser Lys Leu Ala Tyr Asn He 
20 " 25 30 

His Arg Tyr Gly Ser Ser Cys Arg Val Asp Phe Gin Val Arg Ala Asp 
35 40 45 

Gly Gly Ser Gly Ser Arg Ser Ser Val Ala Tyr Lys Glu Gly Phe Val 
50 55 60 

Asp Glu Glu Asp Phe lie Lys Ala Gly Gly Ser Glu Leu Leu Phe Val 
65 70 75 80 

Gin Met Gin Gin Thr Lys Ser Met Glu Lys Gin Ala Lys Leu Ala Asp 
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85 90 95 

» • 

Lys Leu Pro Pro lie Pro Phe Gly Glu Ser Vai Met Asp Leu Val Vai 
100 105 . 110 

♦ 

lie Gly Cvs Gly Pro Ala Gly Leu Ser Leu Ala Ala Glu Ala Ala Lys 
115 120 125 

Leu Gly Leu Lys Val Gly Leu lie Gly Pro Asp Leu Pro Phe Thr Asn 
130 135 140 

Asn Tyr Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Glu Arg 
145 150 155 160 

Cys lie Glu His Ala Trp Lys Asp Thr' lie Val Tyr Leu Asp Asn Asp 
165 170 . 175 

Ala Pro Val Leu lie Gly Arg Ala Tyr Gly Arg Val Ser Arg His Leu 
180 185 .19.0 

Leu His Glu Glu Leu Leu Lys Arg Cys Val Glu Ser Gly Val Ser Tyr 
• 195 200 ' 205 

Leu Asp Ser Lys Val Glu Arg lie Thr Glu Ala Gly Asp Gly His Ser 
210 215 220 

Leu Val Val Cys Glu Asn Glu lie Phe lie Pro Cys Arg Leu Ala Thr 
225 ' 230 235 , 240 

Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Glu Tyr Glu Val Gly 
245 250 255 

Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Vai Glu Val Glu Val 
260 265 270 

Glu Asn Asn Pro Tyr Asp Pro Asn Leu Met Val Phe Met Asp Tyr Arg 
275 280 285 

Asp Tyr Met Gin Gin Lys Leu Gin Cys Ser Glu Glu Glu Tyr Pro Thr 
290 295 300 

Phe Leu Tyr Val Met Pro Met Ser Pro Thr Arg Leu Phe Phe Glu Glu 
305 310 315 320 

Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Arg 
325 330 335 

Lys Leu Met Ser Arg Leu Lys Thr Leu Gly lie Gin Val Thr Lys Val. 
340 345 350 

Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser Leu Pro Asn 
355 360 365 

Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His 
370 " 375 380 

Pro Ala. Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys 
385 390 395 400 

Tyr Ala Ser Val lie Ala Lys lie Leu Lys Gin Asp Asn Ser Ala Tyr 
405 410 415 
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• * ' »* 1 

Val Val Ser Gly Gin Ser Ser Ala Val Asn lie* Ser Met Gin Ala Trp 
420 425 ' 430 

Ser Ser Leu Trp Pro Lys Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu 
4 35 4 40 . 445 

Phe Gly Leu Glu Leu lie Val Gin Leu Asp lie Glu Ala Thr Arg 'Thr 
4 50 ■ 4 55 1 4 60 

Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp Trp Gly Phe 
465 470 475 480 

Leu Gly Ser Ser Leu Ser Ser Phe Asp Leu Val Leu Phe Ser Met Tyr 
485 490, 495 

Met Phe Val Leu $la Pro Asn Ser Met Arg Met Ser Leu Val Arg His 
500 505 510 

Leu Leu Ser Asp Pro Ser Gly Ala Val Met Val Arg Ala Tyr Leu Glu ' , 

515 520 ' 525 

Arg 



<210> 51 

<211> 529 ,\ ' * 

<212> PRT 

<213> Adonis palaestina 
<400> 51 

Met Glu Leu Leu Gly Val Arg Asn Leu lie Ser Ser Cys Pro Val Trp 
1 5 10 ' 15 

Thr Phe Gly Thr Arg Asn Leu Ser Ser Ser Lys Leu Ala Tyr Asn lie 
20 25 30 

His Arg Tyr Gly Ser Ser Cys Arg Val Asp Phe Gin Val Arg Ala Asp 

35 4 0 4 5 

•* ' '* 

Gly Gly Ser Gly Ser Arg Thr Ser Val Ala Tyr Lys Glu Gly Phe Val 

50 55 60 

Asp Glu Glu Asp Phe lie Lys Ala Gly Gly Ser Glu Leu Leu Phe Val 
65 70 75 80 

Gin Met Gin Gin Thr Lys Ser Met Glu Lys Gin Ala Lys Leu Ala Asp 
85 90 " 95 

Lys Leu Pro Pro lie Pro Phe Gly Glu Ser Val Met Asp Leu Val Val 
100 105 110 

lie Gly Cys Gly Pro Ala Gly Leu Ser Leu Ala Ala Glu Ala Ala Lys 
115 120 125 

Leu Gly Leu Lys Val Gly Leu lie Gly Pro Asp Leu Pro Phe Thr Asn 
130 135 140 

Asn Tyr Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Glu Arg 
145 ' 150 155 160 



S3 
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Cys lie Glu His Ala Trp Lys Asp Thr lie Val Tyr Leu Asp Asn Asp 
, . ' 165 170 175 

Ala Pro Val Leu lie Gly Arg Ala Tyr Gly Arg Val.Ser Arg His Leu 

180 185 190 * 

Leu His Glu Glu Leu Leu Lys Arg Cys Val Glu Ser Gly Val Ser Tyr 
195 200 205 

Leu Asn Ser Lys Val Glu Arg lie Thr Glu Ala Gly Asp Gly His Ser 
210 215 220 

Leu Val Val Cys Glu Asn Asp lie Phe lie Pro Cys Arg Leu. Ala Thr 
225 230 1 235 " 240 

Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Glu Tyr Glu Val Gly 
245 250 255 

Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val 
260 265 270 

Glu Asn Asn Pro Tyr Asp Pro Asn Leu Met Val Phe Met Asp Tyr Arg 
275 280 285 

Asp Tyr Met Gin Gin Lys Leu Gin Cys Ser Glu Glu Glu Tyr Pro Thr 
290 295 300 

Phe Leu Tyr Val Met Pro Met Ser Pro Thr Arg Leu Phe Phe Glu Glu 
305 310 315 320 

Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Arg 
325 330 335 

Lys Leu Met Ser Arg Leu Lys Thr Leu Gly lie Gin Val Thr Lys lie 
340 345 350 

Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser Leu Pro Asn 
355 360 365 

Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His 
370 " ~ 375 380 

Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys 
385 390 395 400 

Tyr Ala Ser Val lie Ala Lys lie Leu Lys Gin Asp Asn Ser Ala Tyr 
405 410 415 

Val Val Ser Gly Gin Ser Ser Ala Val Asn He Ser Met Gin Ala Trp 
420 425 430 

Ser Ser Leu Trp Pro Lys Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu 
435 440 445 

Phe Gly Leu Glu Leu He Val Gin Leu Asp He Glu Ala Thr Arg Thr 
450 455 460 

Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp Trp Gly Phe 
465 470 475 " 480 

Leu Gly Ser Ser Leu Ser Ser Phe Asp Leu Val Leu Phe Ser Met Tyr 

3>i 
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4 85 . 490 • 4 95 

Met Phe Val Leu Ala Pro Asn Ser Met Arg Met Ser Leu Val Arg His 
*500 505 510 

Leu Leu Ser Asp Pro Ser Giy Ala Val Met Val Lys Ala Tyr Leu Glu 
515 520 ~ 525 

Arg 



<210> 52 

<211> 533 

<212> PRT 

<213> Lettuce 

<400> 52 

Met Glu Cys Phe Gly Ala. Arg Asn Met Thr ^la Thr, Met Ala Val Phe 
1 5 10 15 

Thr Cys Pro Arg Phe Thr Asp Cys Asn lie Arg His Lys Phe Ser Leu 
20 25 30 

Leu Lys Gin Arg Arg Phe Thr Asn Leu Ser Ala Ser Ser Ser Leu Arg 
35 40 45 

• » 

Gin lie Lys Cys Ser'Ala Lys Ser Asp Arg Cys Val Val Asp Lys Gin 
50 55 60 

Gly lie Ser Val Ala Asp Glu Glu Asp Tyr Val Lys Ala Gly Gly Ser 
65 7 0 ' 75 " 80 

Glu Leu Phe Phe Val Gin Met Gin Arg Thr Lys Ser Met Glu Ser Gin 
85 90 95 

Ser Lys Leu Ser Glu Lys Leu Ala Gin lie Pro lie Gly Asn Cys lie 
100 105 110 

Leu Asp Leu Val Val lie Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala 
115 , 120 125 

Ala Glu Ser Ala Lys Leu Gly Leu Asn Val Gly Leu lie Gly Pro Asp 
130 135 140 

Leu Pro Phe Thr Asn Asn Tyr Gly Val Trp Gin Asp Glu Phe lie Gly 
145 150 155 160 

Leu Gly Leu Glu Gly Cys lie Glu His Ser Trp Lys Asp Thr Leu Val 
165 170 175 

Tyr Leu Asp Asp Ala Asp Pro lie Arg lie Gly Arg Ala Tyr Gly Arg 
180 185 190 

Val His Arg Asp Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu 
195 200 205 

Ser Gly Val Ser Tyr Leu Ser Ser Lys Val Glu Arg lie Thr Glu Ala 
210 215 220 

Pro Asn Gly Tyr Ser Leu lie Glu Cys Glu Gly Asn lie Thr lie Pro 
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225 230 235 240 

Cys Arg Leu Ala Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Phe Leu 
24 5 250 255 

♦ 

Glu Tyr Glu Leu Gly Glv Pro Arg Val Cys Val Gin Thr Ala Tyr Giv 
260 265 270 

He Glu Val Glu Val Glu Asn Asn Pro Tvr Asp Pro Asp Leu Met Val 
275 280 285 

Phe Met Asp Tyr Arg Asp Phe Ser Lys His Lys Pro Glu Ser Leu Glu 
290 295 300 

Ala Lys Tyr Pro Thr Phe Leu Tyr Val' Met Ala Met Ser Pro Thr Lys 

305 310 315 . 320 

* . •■* * 

He Phe Phe Glu Glu Thr Cys Leu Ala Ser Arg Glu Ala Met Pro Phe 
325 330 335 

Asn Leu Leu Lys Ser Lys Leu Met Ser Arg Leu Lys Ala Met Gly He 
34 0 345 350 

Arg He Thr Arg Thr Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly 
355 360 365 

Gly Ser Leu Pro Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala 
370 ■ 375 380 

Ala Ser Met Val His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu 
385 390 395 ^ 400 

Ser Glu Ala Pro Asn Tyr Ala Ala Val He Ala Lys" lie Leu Arg Gin 
405 410 415 

Asp Gin Ser Lys Glu Met He Ser Leu Gly Lys Tyr Thr Asn lie Ser 
420 425 " 430 

Lys Gin Ala Trp Glu Thr Leu Trp Pro Leu Glu Arg Lys Arg Gin Arg 
4 35 4 40 4 45 

Ala Phe Phe Leu Phe Gly Leu Ser His lie Val Leu Met Asp Leu Glu 
450 t 455 460 

Gly Thr Arg Thr Phe Phe Arg Thr Phe Phe Arg Leu Pro Lys Trp Met 
465 470 475 " 480 

Trp Trp Gly Phe Leu Gly Ser Ser Leu Ser Ser Thr Asp Leu He lie 
4 85 4 90 4 95 

Phe Ala Leu Tyr Met Phe Val He Ala Pro His Ser Leu Arg Met Glu 
500 505 510 

Leu Val Arg His Leu Leu Ser Asp Pro Thr Gly Ala Thr Met Val Lys 
515 520 " 525 

Ala Tyr Leu Thr lie 
530 

<210> 53 
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<211> 526 
<212> PRT 
<213> TomatO ( 

<400> 53 

Met Glu Cys Val Gly Val Gin Asn Val Gly Ala Met Ala Val Leu Thr 
15 10 15- 

Arg Pro Arg Leu Asn Arg Trp Ser Gly Gly Glu Leu Cvs Gin Glu Lys 
20 25 30 

Ser lie Phe Leu Ala Tyr Glu Gin Tyr Glu Ser Lys Cys Asn Ser Ser 
35 40 45 

Ser Gly Ser Asp Ser Cys Val Val Asp Lys Glu Asp Phe Ala Asp Glu 
50 „ . 55 ,60 

Glu Asp Tyr lie 'Lys Ala Gly Gly Ser Gin Leu Val Phe Val Gin Met 
65 .70 , 75 , 80 

Gin Gin Lys Lys Asp Met Asp Gin Gin Ser Lys Leu Ser Asp Glu Leu 
85 , 90 * , 95 

Arg Gin He Ser Ala Gly Gin Thr Val Leu Asp Leu Val Val' He Gly. 
100 105 * 110 

Cys Gly Pro Ala Gly •jLeu Ala Leu Ala Ala Gl'u Ser Ala Lys Leu Gly 
115 ; ' 120 125 

Leu Asn Val Gly Leu Val Gly Pro Asp Leu Pro Phe Thr Asn Asn Tyr 
130 135 140 

Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Gin Ala Cys He 
145 150 155 160 

Glu His Val Trp Arg Asp Thr He Val Tyr Leu Asp Asp Asp Glu Pro 
165 170 175 

He Leu He Gly Arg Ala Tyr Gly Arg Val Ser Arg His Phe Leu His 
180 185 " 190 

Glu Glu Leu Leu Lys Arg Cys Val Glu Ala Gly Val Leu Tyr Leu Asn 
195 200 ** 205 

Ser Lys Val Asp Arg He Val Glu Ala Thr Asn Gly Gin Ser Leu Val 
210 215 220 

Glu Cys Glu Gly Asp Val Val He Pro Cys Arg Phe Val Thr Val Ala 
225 230 235 240 

Ser Gly Ala Ala Ser Gly Lys Phe Leu Gin Tyr Glu Leu Gly Ser Pro 
245 250 255 

Arg Val Ser Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val Asp Asn 
260 265 270 

Asn Pro Phe Asp Pro Ser Leu Met Val Phe Met Asp Tyr Arg Asp Tyr 
275 280 285 

Leu Arg His Asp Ala Gin Ser Leu Glu Ala Lys Tyr Pro Thr Phe Leu 
290 295 300 



SI 
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Tyr Ala Met Pro Met Ser Pro Thr Arg Val Phe Phe Glu Glu Thr Cys 
305, . . ' 310 315 320 • 

Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Lys Lvs Leu 

325 330 335 , 

Met- Leu Arg Leu Asn Thr Leu Gly Val Arg lie Lys Glu lie Tyr Glu 
340 345 350 

Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser Leu Pro Asn Thr Glu 
355 360 ' 365 

Gin Lys Thr Leu Ala Phe Gly Ala Ala Ala Ser Met Val His Pro Ala 
370 375 380 

Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys Cys Ala 
385 ' *" 390 395 ^ 400 

Ser Val Leu Ala Asn lie Leu Arg Gin His Tyr Ser Lys Asn Met Leu 
405 410 415 

Thr Ser Ser Ser lie Pro Ser lie Ser Thr Gin Ala Trp Asn Thr Leu 
420 425 430 

Trp Pro Gin Glu Arg Lys Arg Gin Arg Ser Phe Phe Leu Phe Gly Leu 
435 440 445 

Ala Leu lie Leu Gin Leu Asp lie Glu Gly lie Arg Ser Phe Phe Arg 
4 50 4.55 4 6*0 

Ala Phe Phe Arg Val Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser 
465 470 475 480 

Ser Leu Ser Ser Ala Asp Leu Met Leu Phe Ala Phe Tyr Met Phe lie 
485 490 495 

lie Ala Pro Asn Asp Met. Arg Lys Gly Leu lie Arg His Leu Leu Ser 
500 505 510 

Asp Pro Thr Gly Ala Thr Leu lie Arg Thr Tyr Leu Thr Phe 
"515 520 525 

<210> 54 
<211> 516 
<212> PRT 

<213> Tagetes erecta 
<400> 54 

Met Ser Met Arg Ala Gly His Met Thr Ala Thr Met Ala Ala Phe Thr 
1 5 10 15 

Cys Pro Arg Phe Met Thr Ser lie Arg Tyr Thr Lys Gin lie Lys Cys 
20 25 30 

Asn Ala Ala Lys Ser Gin Leu Val Val Lys Gin Glu lie Glu : Glu Glu 
35 40 45 

Glu Asp Tyr Val Lys Ala Gly Gly Ser Glu Leu Leu Phe Val Gin Met 
50 55 60 
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Gin Gin Asn Lys Ser Met Asp Ala Gin .Ser Ser Leu Ser Gin Lys Leu 
65 70 . 75 80 

Pro Arg Val "Pro lie Gly Gly Gly Gly Asp Ser Asn Cys lie Leu Asp 
85 90 95 

Leu Val Val lie Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala Gly. Glu 
100 , 105 110 

Ser Ala Lys Leu Gly Leu Asn Val Ala Leu He Gly Pro Asp Leu Pro 
115 120 125 

Phe Thr Asn Asn Tyr Gly Val Trp Glu Asp Glu Phe He Gly Leu Gly 
130 " 135 140 

Leu Glu Gly Cys lie Glu His Val Trp Arg Asp Thr Val Val Tyr Leu 
145 150 155 160 

Asp Asp Asn Asp Pro He. Leu He Gly Arg A^La. Tyr Gly Arg Val Ser • 
165 170 ■ 175 

Arg Asp Leu Leu His Glu Glu Leu Leu Thr Arg Cys* Met Glu Ser Gly 
180 185 , ' 190 

Val Ser Tyr Leu Ser Ser Lys Val Glu Arg lie Thr Glu Ala Pro Asn 
195 200 205 

Gly Leu Ser Leu •He'felu Cys Glu Gly Asn lie Thr He Pro Cys Arg 
210 215 220 

Leu Ala Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr 
225 230 235 240 

Glu Leu Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly He Glu 
245 250 ~ 255 

Val Glu Val Glu Ser He Pro ' Tyr Asp Pro Ser Leu Met Val Phe Met 
260 265 270 

Asp Tyr Arg Asp Tyr Thr Lys His Lys Ser Gin Ser Leu Glu Ala Gin 
275 , 280 285 

Tyr Pro Thr Phe Leu Tyr Val Met Pro Met Ser Pro Thr Lys Val Phe 
290 295 300 

Phe Glu Glu Thr Cys Leu Ala Ser Lys Glu Ala Met Pro Phe Glu Leu 
305 310 315 320 

Leu Lys Thr Lys Leu Met Ser Arg Leu Lys Thr Met Gly He Arg lie 
325 330 ^ 335 

Thr Lys Thr Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser 
340 345 350 

Leu Pro Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser 
355 360 365 

Met Val His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu 
370 375 380 

Ala Pro Asn Tyr Ala Ala Val lie Ala Lys lie Leu Gly Lys Gly Asn 
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385 390 395 400 

Ser *Lys Glm' Met Leu Asp His Gly Arg Tyr Thr Thr Asn He Ser Lys 
405 410 415 

Gin Ala Trp Glu Thr Leu Trp Pro Leu Glu Arg Lys Arg Gin Arg Ala 
4 20 4 25 4 30 

Phe Phe Leu Phe Gly Leu Ala Leu He Val Gin Met Asp He Glu Gly 
4 35 4 40 445 

Thr Arg Thr Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp 
450 455 460 

Trp Gly Phe Leu Gly Ser Ser Leu Ser* Ser Thr Asp Leu He He Phe 
465 470 475 - 480 

Ala Phe Tyr Met Phe lie He Ala Pro His Ser Leu Arg Met Gly Leu 
4 85 4 90 4 95 

Val Arg His Leu Leu Ser Asp Pro Thr Gly Gly Thr Met Leu Lys Ala 
500 505 510 

Tyr Leu Thr lie 
515 

<210> 55 
<211> 501 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 55 

Met Asp Thr Leu Leu Lys Thr Pro Asn Lys Leu Asp Phe Phe He Pro 
■ 1 5 10 15 

Gin Phe His Gly Phe Glu Arg Leu Cys Ser Asn Asn Pro Tyr His Ser 
20 25 30 

Arg Val Arg Leu Gly Val Lys Lys Arg Ala lie Lys lie Val Ser Ser 
-35 4 0 4 5 

Val Val Ser Gly Ser Ala Ala Leu Leu Asp Leu Val Pro Glu Thr Lys 
50 55 .60 

Lys Glu Asn Leu Asp Phe Glu Leu Pro Leu Tyr Asp Thr Ser Lys Ser 
65 70 75 ** 80 

Gin Val Val Asp Leu Ala lie Val Gly Gly Gly Pro Ala Gly Leu Ala 
85 90 95 

Val Ala Gin Gin Val Ser Glu Ala Gly Leu Ser Val Cys Ser lie Asp 
100 105 110 

Pro Ser Pro Lys Leu lie Trp Pro Asn Asn Tyr Gly Val Trp Val Asp 
115 120 125 

Glu Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Thr Thr Trp Ser 
130 135 ~ " 140 

Gly Ala Val Val Tyr Val Asp Glu Gly Val Lys Lys Asp Leu Ser Arg 
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145 150 ' 155 ' 160 

Pro Tyr Giy Arg Val Asn Arg Lys Gin Leu Lys Ser Lys Met Leu Gin 
165 170 175 

Lys Cys lie Thr Asn Gly Val Lys Phe His Gin Ser Lys Val Thr Asn 
180 185 190 

Val Val His Glu Glu Ala Asn Ser Thr Val Val Cys Ser Asp Gly Val 
195 200 205 

Lys lie Gin Ala Ser Val Val Leu Asp Ala Thr Gly Phe Ser Arg Cys 
210 215 220 

Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin Val Ala Tyr 
225 „ . 230 235 • 240 

Gly lie Val Ala • Glu Val Asp Gly His Pro Phe Asp Val Asp Lys Met 
2,45 . , 250 , . , • 255 

Val Phe Met Asp Trp Arg Asp Lys His Leu Asp Ser Tyr Pro Glu Leu 
260 ' , 265 ~ * 270 

Lys Glu Arg Asn Ser L^s lie Pro Thr Phe Leu Tyr Ala Met Pro Phe 
275 280 " 285 

Ser Ser Asn Arg lie »phe Leu Glu Glu Thr.Se'r Leu Val Ala Arg Pro 
2 90 /' 2 95 300 

Gly Leu Arg Met Glu Asp lie Gin Glu Arg Met Ala Ala Arg Leu Lys 
305 310 315 J 320 

His Leu Gly lie Asn Val Lys Arg lie Glu Glu Asp Glu Arg Cys Val 
325 330 335 

lie Pro Met Gly Gly Pro Leu Pro Val Leu Pro Gin Arg Val Val Gly 
340 345 350 

lie Gly Gly Thr Ala Gly Met Val His Pro Ser Thr Gly Tyr Met Val 
355 360 365 

Ala Arg Thr Leu Ala Ala Ala Pro lie Val Ala Asn Ala lie Val Arg 
370 375 380 

Tyr Leu Gly Ser Pro Ser Ser Asn Ser Leu Arg Gly Asp Gin Leu Ser 
385 390 395 " * 400 

Ala Glu Val Trp Arg Asp Leu Trp Pro lie Glu Arg Arg Arg Gin Arg 
405. 410 " ' 415 

Glu Phe Phe Cys Phe Gly Met Asp lie Leu Leu Lys Leu Asp Leu Asp 
4 20 4 25 4 30 

Ala Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Gin Pro His Tyr 
4 35 4 40 44 5 

Trp His Gly Phe Leu Ser Ser Arg Leu Phe Leu Pro Glu Leu Leu Val 
450 455 460 

Phe Gly Leu Ser Leu Phe Ser His Ala Ser Asn Thr Ser Arg Leu Glu 
465 470 475 480 
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lie Met Thr Lys Gly Thr Val Pro Leu Ala Lys Met lie Asn Asn Leu 
. ' 465 490 495 

Val Gin Asp Arg Asp 
500 

<210> 56 
<211> 502 
<212> PRT 

<213> Adonis palaestina 
<400> 56 

Met Asp Thr Leu Leu Arg Thr His Asn Lys Leu Glu Leu Leu Pro Thr 
1 5 10 15 

Leu His Gly Phe Ala Glu Lys Gin His Leu Val Ser Thr Ser Lys Leu 
20 25 30 

Gin Asn Gin Val Phe Arg lie Ala Ser Arg Asn lie His Pro Cys Arg 
35 4 0 4 5 

Asn Gly Thr Val Lys Ala Arg Gly Ser Ala Leu Leu Glu Leu Val Pro 
50 55 60 

Glu Thr Lys Lys Glu Asn Leu Glu Phe Asp Leu Pro Ala Tyr Asp Pro 
65 ~ ~ 7Q 75 8Q 

Ser Arg Gly lie Val Val Aso Leu Ala Val Val Gly Gly Gly Pro Ala 
^85 90 95 

Gly Leu Ala lie Ala Gin Gin Val Ser Glu Ala Gly Leu Leu Val Cys 
100 105 110 

Ser lie Asp Pro Ser Pro Lys Leu lie Trp Pro Asn Asn Tyr Gly Val 
115 120 125 

Trp Val Asp Glu Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Thr 
130 ^ 135 140 

Thr Trp "Ser Gly Ala Val Val Tyr Thr Asp Asp Asn Ser Lys Lys Tyr 
145 150 155 160 

Leu Asp Arg Pro Tyr Gly Arg Val Asn Arg Lys Gin Leu Lys Ser Lys 
165 170 175 

Met Leu Gin Lys Cys Val Thr Asn Gly Val Lys Phe His Gin Ala Lys 
180 185 190 

Val lie Lys Val lie His Glu Glu Ser Lys Ser Leu Leu lie Cys Asn 
195 200 205 

Asp Gly lie Thr lie Asn Ala Thr Val Val Leu Asp Ala Thr Gly Phe 
210 215 220 

Ser Arg Cys Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin 
225 230 235 240 

Val Ala Tyr Gly lie Met Ala Glu Val Glu Glu His Pro Phe Asp Leu 
245 250 255 
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Asp Lys Met Leu Phe Met Asp Trp Arg 'Asp Ser His Leu Asn Glu Lys 
260 265 ' 270 

Leu Glu Leu Lys Asp Lys Asn Arg Lys lie Pro Thr Phe Leu Tyr Ala 
275 280 285 

Met Pro Phe Ser Ser Thr Lys lie Phe Leu Glu Glu Thr Ser Leu* Val 
290 • 295 300 

Ala Arg Pro Gly Leu Arg Phe Glu Asp lie Gin Glu Arg Met Val Ala 
305 310 315 320 

Arg Leu Lys His Leu Gly lie Lys Val Lys Ser He Glu Glu Asp Glu 
325 330 335 

Arg Cys Val -He .Pro Met Gly Gly Pro Leu Pro Val Leu Pro Gin Arg 
340 345 350 

Val Val Gly He Gly Gly Thr Ala Gly Met Val His 'Pro Ser Thr Gly ' ' . 
355 360 ' 365 

Tyr Met Val Ala Arg Thr. Leu Ala Ala Ala Pro Val Val Ala Lys Ser 
370 375 • 380 

He Val Gin Tyr Leu Gly Ser Asp Arg Ser Leu Ser Gly Asn Glu Leu 
385 390 395 400 

Ser Ala Glu Val. Trp Lys Asp Leu Trp Pro lie Glu Arg Arg Arg Gin 
405 410 " 415 

Arg Glu Phe Phe Cys Phe Gly Met Asp He Leu Leu Lys Leu Asp Leu 
420 425 , 430 

Gin Gly Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro His 
435 440 445 

Tyr Trp His Gly Phe Leu Ser Ser Arg Leu Phe Leu Pro Glu Leu Leu 
450 455 460 

Phe Phe Glv Leu £er Leu Phe Ser His Ala Ser Asn Ala Ser Arg He 
465 ' ' 470 475 480 

Glu He Met Ala Lys Gly Thr Val Pro Leu Val Asn Met Met Asn Asn 
4 85 4 90 4 95 

Leu He Gin Asp Thr Asp 
500 



<210> 57 
<211> 498 
<212> PRT 
<213> Pepper 



<400> 57 

Met Asp Thr Leu Leu Arg Thr Pro Asn Asn Leu Glu Phe Leu His Gly 
15 10 15 

Phe Gly Val Lys Val Ser Ala Phe Ser Ser Val Lys Ser Gin Lys Phe 
20 25 " 30 
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Gly Ala Lvs Lvs Phe Cys Glu Gly Leu Gly Ser Arg Ser Val Cys Val 
35, . 40 45 

Lys Ala Ser Ser Ser Ala Leu Leu Glu Leu Val Pro Glu Thr Lys Lys 
50 55 60 

Glu- Asn Leu Asp Phe Glu Leu Pro Met Tyr Asp Pro Ser Lys Gly Val 
65 70 7 5 J 80 

Val Val Asp Leu Ala Val Val Gly Gly Gly Pro Ala Gly Leu Ala Val 
85 90 95 

Ala Gin Gin Val Ser Glu Ala Gly Leu Ser Val Cys Ser lie. Asp Pro 
100 105 110 

Asn Pro Lys Leu lie Trp Pro Asn Asn Tyr Gly Val Trp Val Asp Glu 
"115 120 125 

Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Ala Thr Trp Ser Gly 
130 135 140 

Ala Ala Val Tyr lie Asp Asp Lys Thr Thr Lys Asp Leu Asn Arg Pro 
145 150 155 160 

Tyr Gly Ara Val Asn Arg Lys Gin Leu Lvs Ser Lys Met Met Gin Lys 
165 170 " 175 

Cys lie Leo Asn Gly Val Lys Phe His Gin Ala Lys Val lie Lys Val 
180 185 190 

lie His Glu Glu Ser Lys Ser Met Leu lie Cys Asn Asp Gly lie Thr 
195 200 205 

lie Gin Ala Thr Val Val Leu Asp Ala Thr Gly Phe Ser Arg Ser Leu 
210 215 220 

Val Gin Tyr Asp Lys Pro . Tyr Asn Pro Gly Tyr Gin Val Ala Tyr Gly 
225 230 235 240 

lie Leu Ala Glu Val Glu Glu His Pro Phe Asp Val Asn Lys Met Val 
245 250 255 

Phe Met Asp Trp Arg Asp Ser His Leu Lys Asn Asn Val Glu Leu Lys 
260 265 270 

Glu Arg Asn Ser Arg lie Pro Thr Phe Leu Tyr Ala Met Pro Phe Ser 
275 280 285 

Ser Asn Arg lie Phe Leu Glu Glu Thr Ser Leu Val Ala Arg Pro Gly 
290 295 . 300 

Leu Gly Met Asp Asp lie Gin Glu Arg Met Val Ala Arg Leu Ser His 
305 5 310 315 320 

Leu Gly lie Lys Val Lys Ser lie Glu Glu Asp Glu His Cys Val lie 
325 330 335 

Pro Met Gly Gly Pro Leu Pro Val Leu Pro Gin Arg Val Val Gly lie 
340 345 350 

Gly Gly Thr Ala Gly Met Val His Pro Ser Thr Gly Tyr Met Val Ala 
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355 360 ' ' ' 365 

Arg Thr Leu Ala Ala Ala Pro Val Val Ala Asn Ala lie He Gin Ty^- 
370 375 380 

Leu Ser Ser Glu Arg Ser His Ser Gly Asp Glu Leu Ser Ala Ala Val 
385 390 395 • 400 

Trp Lys Asp Leu Trp Pro He Glu Arg Arg Arg Gin Arg Glu Phe Phe 
405 410 415 

Cys Phe Gly Met Asp He Leu Leu Lys Leu Asp Leu Pro Ala Thr Arg 
420 425 430 

Arg Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro Arg Tyr Trp His Gly 
4 35 ... 440 - 445 

Phe Leu Ser Ser* Arg Leu Phe Leu Pro Glu Leu He Val Phe Gly Leu 
450 , . 455 , • 460. 

Ser Leu Phe Ser His Ala Ser Asn Thr Ser Arg Leu Glu lie Met Thr 
465 470. 475 , 480 

Lys Gly Thr Leu Pro Leu Val His Met He Asn Asn Leu Leu Gin Asp 
485 490 495 

Lys Glu • - 



<210> 58 

<211> 500 

<212> PRT 

<213> Tomato 

<400> 58 

Met Asp Thr Leu Leu Lys Thr Pro Asn Asn Leu Glu Phe Leu Asn Pro 
1 5 10 15 

His His Gly Phe A\a Val Lys Ala Ser Thr Phe Arg Ser Glu Lys His 
20 • 25 30 

His Asn Phe Gly Ser Arg Lys Phe Cys Glu Thr Leu Gly Arg Ser Val 
35 4 0 4 5 

Cys Val Lys Gly Ser Ser Ser Ala Leu Leu Glu Leu Val Pro Glu Thr 
50 55 60 

Lys Lys Glu Asn Leu Asp Phe Glu Leu Pro Met Tyr Asp Pro Ser Lys 
65 70 75 80 

Gly Val Val Val Asp Leu Ala Val Val Gly Gly Gly Pro Ala Gly Leu 

85 90 ' 95 

Ala Val Ala Gin Gin Val Ser Glu Ala Gly Leu Ser Val Cys Ser He 
100 105 110 

Asp Pro Asn Pro Lys Leu lie Trp Pro Asn Asn Tyr Gly Val Trp Val 
115 120 125 

Asp Glu Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Ala Thr Trp 
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130 135 140 

Ser Gly Ala* *Ala Val Tvr lie Asp Asp Asn Thr Ala Lvs Asp Leu His 
145 150 155 . ~ 160 

♦ 

Arg Pro Tyr Gly Arg Val Asn Arg Lys Gin Leu Lys Ser Lys Met Met 
165 170 175 

Gin Lys Cys lie Met Asn Gly Val Lys Phe His Gin Ala Lys Val lie 
180 185 190 

Lys Val lie His Glu Glu Ser Lys Ser Met Leu lie Cys Asn Asp Gly 
195 200 205 

He Thr He Gin Ala Thr Val Val Leu Asp- Ala Thr Gly Phe Ser Arg 
210 215 220 

Ser Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin Val Ala 
225 " 230 235 ^ 240 

Tyr Gly He Leu Ala Glu Val Glu Glu His Pro Phe Asp Val Asn Lys 
245 • 250 • 255 

Met Val Phe Met Asp Trp Arg Asp Ser His Leu Lys Asn Asn Thr Asp 
260 265 270 

Leu Lys Glu Arg Asn Ser Arg He Pro Thr Phe Leu Tyr Ala Met Pro 
275- 280 285 

Phe Ser Ser Asn Arg He Phe Leu Glu Glu Thr Ser Leu Val Ala Arg 
290 295 300 

Pro Gly Leu Arg He Asp Asp He Gin Glu Arg Met Val Ala Arg Leu 
305 " 310 315 320 

Asn His Leu Gly He Lys Val Lys Ser lie Glu Glu Asp Glu His Cys 
325 330 * 335 

Leu He Pro Met Gly Gly Pro Leu Pro Val Leu Pro Gin Arg Val Val 
340 345 350 

Gly He Gly Gly Thr Ala Gly Met Val His Pro Ser Thr Gly Tyr Met 
355 360 365 

Val Ala Arg Thr Leu Ala Ala Ala Pro Val Val Ala Asn Ala He He 
370 375 380 

Gin Tyr Leu Gly Ser Glu Arg Ser His Ser Gly Asn Glu Leu Ser Thr 
385 390 395 400 

Ala Val Trp Lys Asp Leu Trp Pro He Glu Arg Arg Arg Gin Arg Glu 
405 410 415 

Phe Phe Cys Phe Gly Met Asp He Leu Leu Lys Leu Asp Leu Pro Ala 
420 425 430 

Thr Arg Ara Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro Arg Tyr Trp 
4 35 440 44 5 

His Gly Phe Leu Ser Ser Arg Leu Phe Leu Pro Glu Leu He Val Phe 
450 455 460 
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Gly Leu Ser Leu Phe 
465 

Met Thr Lys Gly Thr 
485 

Gin Asp Lys Glu 

500 ■ 



Ser His Ala Ser Ash Thr 

470 • 475 

Val Pro Leu Val Asn Met 
490 



Ser Arg Phe Glu lie 
480 

lie Asn Asn Leu Leu 
495 



<210> 59 

<211> 500 

<212> PRT 

<213> Tobacco 

<400> 59 .. .. 

Met Asp Thr Leu Leu Lys Thr Pro Asn Lys Leu Glu Phe Leu His Pro 
1 • 5 10 15 

Val His Gly Phe Ser Val Lys Ala Ser Ser Phe Ash Ser Val Lys Pro 
20 25 30 

His Lys Phe Gly Ser Arg Lys lie Cys Glu Asn .Trp Gly Lys Gly Val 
35 '40 45 

Cys Val Lys Ala Lys Ser Ser Ala Leu Leu Glu Leu Val Pro Glu Thr 
50 ^ ' 55 ' 60 

Lys Lys Glu Asn Leu Asp Phe Glu Leu Pro Met Tyr Asp Pro Ser Lys 
65 70 75 , 80 

Gly Leu Val Val Asp Leu Ala Val Val Gly Gly Gly Pro Ala Gly Leu 
85 90 95 

Ala Val Ala Gin Gin Val Ser Glu Ala Gly Leu Ser Val Val Ser lie 
100 105 110 

Asp Pro Ser Pro Lys Leu lie Trp Pro Asn Asn Tyr Gly Val Trp Val 
115 120 . 125 

* • " 

Asp Glu Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Ala Thr Trp 
130 ' 135 * 14 0 

Ser Gly Thr Val Val Tyr lie Asp Asp Asn Thr Thr Lys Asp Leu Asp 
145 150 155 * * 160 

Arg Pro Tyr Gly Arg Val Asn Arg Lys Gin Leu Lys Ser Lys Met Met 
165 ~ 170 J 175 

Gin Lys Cys lie Leu Asn Gly Val Lys Phe His His Ala Lys Val lie 
180 185 190 

Lys Val lie His Glu Glu Ala Lys Ser Met Leu lie Cys Asn Asp Gly 
195 200 205 

Val Thr He Gin Ala Thr Val Val Leu Asp Ala Thr Gly Phe Ser Arg 
210 215 220 

Cys Leu Val Gin Tyr Asp Lys Pro Tyr Lys Pro Gly Tyr Gin Val Ala 
225 230 235 240 
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Tyr Gly He Leu Ala. Glu Val Glu Glu His Pro Phe Asp Thr Ser Lys 
, ' 245 250 255 

Met Val Leu Met Asp Trp Arg Asp Ser His Leu Gly. Asn Asn Met Glu 

260 265 270 < 

■Leu «Lys Glu Arg Asn Arg Lys Val Pro Thr Phe Leu Tyr Ala Met Pro 
275 280 285 

Phe Ser Ser Asn Lys He Phe Leu Glu Glu Thr Ser Leu Val Ala Arg 
290 295 300 

Pro Gly Leu Arg Met Asp Asp He Gin Glu Arg Met Val Ala Arg Leu 
305 310 ' 315 320 

Asn His Leu Gly He Lys Val Lys Ser He Glu Glu Asp Glu His Cys 
325 330 4 335 

Val He Pro Met Gly Gly Ser Leu Pro Val He Pro Gin Arg Val Val 
340 345 350 

Gly Thr Gly Gly Thr Ala Gly Leu Val His Pro Ser Thr Gly Tyr Met 
355 360 365 

Val Ala Arg Thr Leu Ala Ala Ala Pro Val Val Ala Asn Ala He He 
370 375 380 

His Tyr Leu Gly Ser Glu Lys Asp Leu Leu Gly Asn Glu Leu Ser Ala 
385 390 395 ' 400 

Ala Val Trp Lys Asp Leu Trp Pro lie Glu Arg Arg Arg Gin Arg Glu 
405 410 415 

Phe Phe Cys Phe Gly Met Asp lie Leu Leu Lys Leu Asp Leu Pro Ala 
420 425 430 

Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro Arg Tyr Trp 
4 35 440 445 

His Gly Phe Leu Ser Ser Arg Leu Tyr Leu Pro Glu Leu lie Phe Phe 
450 " 455 460 

Gly Leu Ser Leu Phe Ser Arg Ala Ser Asn Thr Ser Arg lie Glu lie 
465 470 475 480 

Met Thr Lys Gly Thr Leu Pro Leu Val Asn Met lie Asn Asn Leu Leu 
4 85 4 90 4 95 

Gin Asp Thr Glu 
500 

<210> 60 
<211> 511 
<212> PRT 

<213> Tagetes erecta 
<400> 60 

Met Asp Thr Phe Leu Arg Thr Tyr Asn Ser Phe Glu Phe Val His Pro 
15 10 15 
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Ser Asn Lys Phe Ala Giy Asn Leu Asn Asn Leu Asn Gin Leu Asn Gin 
20 25 30 

Ser Lys Ser Gin Phe Gin Asp Phe Arg Phe Gly Pro Lys Lys Ser Gin 
35 4 0 45 

Phe Lys Leu Gly Gin Lys Tyr Cys Val Lys Ala Ser Ser Ser Ala 'Leu 
50 • 55 60 

Leu Glu Leu Val Pro Glu lie Lys Lys Glu Asn Leu Asp Phe Asp Leu 
65 70 75 80 

Pro Met Tyr Asp Pro Ser Arg Asn Val Val Val Asp Leu Val Val Val 
85 90, 95 

Gly Gly Gly Pro S,er Gly Leu Ala Val Ala Gin Gin Val Ser Glu Ala 
100 105 110 

Gly Leu Thr Val Cys Ser He Asp Pro Ser Pro Lys Leu He Trp Pro ' , 

115 120 ' 125 

Asn Asn Tyr Gly Val Trp Val Asp Glu Phe Glu Ala ' Met Asp Leu Leu 
130 " 135 .140 

Asp Cys Leu Asp Thr Thr Trp Ser Ser Ala Val Val Tyr He Asp Glu 
145 150 155 160 

Lys Ser Thr Lys Ser Leu Asn Arg Pro Tyr Ala Arg Val Asn Arg Lys 
165 170 175 

Gin Leu Lys Thr Lys Met Leu Gin Lys Cys He Ala Asn Gly Val Lys 
180 185 , 190 

Phe His Gin Ala Lys Val He Lys Val He His Glu Glu Leu Lys Ser 
195 200 205 

Leu Leu He Cys Asn Asp Gly Val Thr He Gin Ala Thr Leu Val Leu 
210 215 220 

Asp Ala Thr Gly P,he Ser Arg Ser Leu Val Gin Tyr Asp Lys Pro Tyr 
225 ■ ' 230 235 ~ 240 

Asn Pro Gly Tyr Gin Val Ala Tyr Gly He Leu Ala Glu Val Glu Glu 
245 250 255 

His Pro Phe Asp Val Asp Lys Met Leu Phe Met Asp Trp Arg Asp Ser 
260 265 270 

His Leu Asp Gin Asn Leu Glu He Lys Ala Arg Asn Ser Arg He Pro 
275 280 " 285 

Thr Phe Leu Tyr Ala Met Pro Phe Ser Ser Thr Arg He Phe Leu Glu 
290 295 300 

Glu Thr Ser Leu Val Ala Arg Pro Gly Leu Lys Met Glu Asp lie Gin 
305 310 315 320 

Glu Arg Met Ala Tyr Arg Leu Lys His Leu Gly He Lys Val Lys Ser 
325 330 335 

lie Glu Glu Asp Glu Arg Cys Val He Pro Met Gly Gly Pro Leu Pro 



Us 
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340 > 345 350 

Val Leu Pro**Gln Arg Val Leu Gly lie Giy Gly Thr Ala Gly Met Val 
355 360 . 365 

His Pro Ser Thr Gly Tyr Met Val Ala Arg Thr Leu Ala Ala Ala Pro 
.370 375 380 

lie Val Ala Lys Ser lie lie Arg Tyr Leu Asn Asn Glu Lys Ser Met 
385 390 395 400 

Val Ala Asp Val Thr Gly Asp Asp Leu Ala Ala Gly lie Trp Arg Glu 
4 05 410 4 15 

Leu Trp Pro lie Glu Arg Arg Arg Gin 'Arg Glu Phe Phe Cys Phe Gly 
420 425 430 

Met Asp lie Leu Leu Lys Leu Asp Leu Glu Gly Thr Arg Arg Phe Phe 
435 440 445 

Asp Ala Phe Phe Asp Leu Glu Pro Arg Tyr Trp His Gly Phe Leu Ser 
450 455 • 460 

Ser Arg Leu Phe Leu Pro Glu Leu Val Thr Phe Gly Leu Ser Leu Phe 
465 470 475 480 

Gly His Ala Ser Asn Thr Cys Arg Val Glu lie Met Ala Lys Gly Thr 
4 85 4 90 4 95 

Leu Pro Leu Ala Thr Met lie Gly Asn Leu Val Arg Asp Arg Glu 
500 505 510 

<210> 61 
<211> 503 
<212> PRT 
' <213> Daffodil 

<400> 61 

Met Asp Thr Leu Leu Arg Thr His Asn Arg Leu Glu Leu Leu Tyr Pro 
1 - 5 10 15 

Leu His Glu Leu Ala Lys Arg His Phe Leu Ser Pro Ser Pro Asn Pro 
20 25 30 

Gin Asn Pro Asn Phe Lys Phe Phe Ser Arg Lys Pro Tyr Gin Lys Lys 
35 40 45 

Cys Arg Asn Gly Tyr lie Gly Val Ser Ser Asn Gin Leu Leu Asp Leu 
50 ' 55 60 

Val Pro Glu lie Lys Lys Glu His Leu Glu Phe Asp Leu Pro Leu Tyr 
65 70 75 80 

Asp Pro Ser Lys Ala Leu Thr Leu Asp Leu Ala Val Val Gly Gly Gly 
85 90 95 

Pro Leu Ala Arg Ser Cys Ser Thr Ser Leu Gly Gly Gly Leu Ser Val 
100 " 105 110 

Val Ser lie Asp Pro Asn Pro Lys Leu lie Trp Pro Asn Asn Tyr Gly 



To 
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115 120 ' *"i ' 125 

Val Trp Val Asp Glu Phe Glu Asp Met Asp Leu Leu Asp Cys Leu Asp 
130 135 140 

Ala Thr Trp Ser Gly. Ala He Val Tyr Val Asp Asp Arg Ser Thr Lys 
145 150 155 -160 

Asn Leu Ser Arg Pro Tyr Ala Arg Val Asn Arg Lys Asn Leu Lys Ser 
165 170 175 

Lys Met Met Lys Lys Cys Val Ser Asn Gly Val Arg Phe His Gin Ala 
180 185 190 

Thr Val Val Lys Ala Met His Glu Glu Glu Lys Ser Tyr Leu He Cys 

195 ... ■„ 2 00 . 205 

Ser Asp Gly Val 'Thr He Asp Ala Arg Val Val Leu Asp Ala Thr Gly 
210 . 215 , 220 . 

Phe Ser Arg Cys Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr 
225 ■ 230 . 235 240 

Gin Val Ala Tyr Gly He Leu Ala Glu Val Glu Glu His Pro' Phe Asp* 
245 250 255 

Val Asp Lys Met Val phe Met Asp Trp Arg Asp Ser His Leu Asn Gly 
260 ; 265 270 

Lys Ala Glu Leu Asn Glu Arg Asn Ala Lys He Pro Thr Phe Leu Tyr 
275 280 285 

Ala Met Pro Phe Ser Ser Asn Arg lie Phe Leu Glu Glu Thr Ser Leu 
290 295 300 

Val Ala Arg Pro Gly Leu Lys Met Glu Asp He Gin Glu Arg Met Val 
305 310 315 " 320 

Ala Arg Leu Asn His Leu Gly lie Arg lie Lys Ser He Glu Glu Asp 
3?£ 330 335 

Glu Arg Cys Val He Pro Met Gly Gly Pro Leu Pro Val lie Pro Gin 
340 345 350 

Arg Val Val Gly lie Gly Gly Thr Ala Gly Met Val His Pro Ser Thr 
355 360 365 

Gly Tyr Met Val Ala Arg Thr Leu Ala Ala Ala Pro He Val Ala Asn 
370 375 380 

Ser He Val Gin Tyr Leu Val Ser Asp Ser Gly Leu Ser Gly Asn Asp 
385 390 395 " 400 

Leu Ser Ala Asp Val Trp Lys Asp Leu Trp Pro lie Glu Arg Arg Arg 
4 05 410 415 

Gin Arg Glu Phe Phe Cys Phe Gly Met Asp lie Leu Leu Lys Leu Asp 
420 425 430 

Leu Glu Gly Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro 
435 440 445 
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Arg Tyr Trp His Gly Phe Leu Ser Ser Arg Leu Phe Leu Pro Glu Leu 

,450 , . ^ 455 460 

« 

Val Pro Phe Gly Leu Ser Leu Phe Ser His Ala Ser Asn Thr Cys Lys 
465 470 475 480 

Leu* Glu lie Met Ala Lys Gly Thr Leu Pro Leu Val Asn Met lie Asn 
485 490 495 

Asn Leu Val Gin Asp Arg Asp 
500 
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